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Abstract. We review selected advances in the theoretical understanding of complex 
quantum many-body systems with regard to emergent notions of quantum statistical 
mechanics. We cover topics such as equilibration and thermalisation in pure state 
statistical mechanics, the eigenstate thermalisation hypothesis, the equivalence of 
ensembles, non-equilibration dynamics following global and local quenches as well 
as ramps. We also address initial state independence, absence of thermalisation, 
and many-body localisation. We elucidate the role played by key concepts for 
these phenomena, such as Lieb-Robinson bounds, entanglement growth, typicality 
arguments, quantum maximum entropy principles and the generalised Gibbs 
ensembles, and quantum (non-)integrability. We put emphasis on rigorous approaches 
and present the most important results in a unified language. 
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1. Introduction 

At the time when quantum theory was developed in a creative rush in the last years of 
the twenties of the previous century, classical statistical physics was already a mature 
held of research. The landmark book “Elementary principles in statistical mechanics” jl| 
authored by Gibbs had already been published in 1902, which is seen by many as the the 
birth of modern statistical mechanics j3]- So, as soon as the mathematical framework 
of the “neue Mechanik”, the new mechanics, as von Neumann called it in 1929, was 
established, signihcant efforts were made by him to prove ergodicity and a tendency to 
evolve into states that maximise entropy, which became known as the H-theorem, in this 
setting Q . The held of quantum statistical mechanics soon emerged and can by now be 
considered an important pillar of theoretical physics 0]. 

Still, some foundational yet fundamental questions remain open, much related to 
the questions raised by von Neumann. While maximum entropy principles provide a 
starting point for the understanding of the ensembles of quantum statistical mechanics, 
it seems much less clear how quantum states taking extremal values for the entropy are 
being achieved via microscopic dynamics. After all, at the fundamental level, quantum 
many-body systems follow the Schrodinger equation, giving rise to unitary dynamics. 
It is far from obvious, therefore, in what precise way interacting quantum many-body 
systems can equilibrate. The microscopic description of quantum mechanical systems 
following the dynamical equations of motion is still in some tension with the picture 
























1 INTRODUCTION 


4 


arising from the ensemble description of quantum statistical mechanics. That is to say, 
the questions of equilibration and thermalisation in what we will call pure-state quantum 
statistical mechanics remained largely unresolved until very recently. 

These foundational questions came back with a vengeance not too long ago. Old 
puzzles and new questions of quantum many-body systems out of equilibrium re-entered 
the centre of attention and are again much in the focus of present-day research in 
quantum many-body theory. This remarkable renaissance is primarily due to three 
concomitant factors in physics research. 

The hrst and arguably most important source of inspiration has been an 
experimental revolution. Fueled by enormous improvements in experimental techniques 
it became feasible to control quantum systems with many degrees of freedom. An 
entirely new arena for the study of physics of interacting quantum many-body systems 
emerged. This is particularly true for the development of techniques to cool and trap 
ultra-cold atoms and to subject them to optical lattices Is-lOl or suitable conhnements, 
giving rise to low-dimensional continuous systems lll-ll6|. Similarly, systems of 
trapped ions IT], [iSj, as well as hybrid systems Il9|, allow to precisely study the 
physics of interacting systems in the laboratory |2nl-l25|. In such hi ghly controlled 
settings, equilibration and thermalisation dynamics has been studied 26l-l30|. Especially 
setups with optical lattices allow for the realisation of condensed-matter-like interacting 
many-body systems in the laboratory, but with hne grained control over the model 
parameters and geometries. Questions concerning the out of equilibrium dynamics of 
such systems were suddenly not only important out of academic curiosity, but became 
pragmatically motivated questions important to understand experimentally realisable 
physical situations. 

The second major development is the broad availability of new machines: 
supercomputers. With the vastly increased computing power and massive parallelisation 
as well as novel numerical techniques such as tensor network methods 


311 and the 


density-matrix renormalisation group method 3^, it has become possible to simulate 
the dynamics of large quantum systems for relatively long times. Methods for exact 


diagonalisation have been brought to new levels |33H35|. complemented by quantum 
Monte Carlo techniques 36, 1^. and applications of dynamical mean held theory 
and density functional theorv|^. Idol . There is an enormous body of numerical works 


41 


33 

34 


dynamics of quantum phase transitions 
i.e., rapid changes of the Hamiltonian 
complemented by partly or entirely analytical studies that capture these and related 


. often with a focus on so-called quenches, 
This body of numerical work is 


50-56 


57-66 . We 


phenomena in concrete systems or classes of models (often integrable ones) 
will discuss these works in more detail later. 

Last but not least, our understanding of quantum mechanics has improved 
signihcantly since the time of von Neumann. The availability of new mathematical 
methods — in part motivated by research in quantum information theory — is the third 
driving force. These techniques have made some of the old questions become more 
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tractable than before, while at the same time new paradigms of approaching the key 


qnestions 
states 67 


rave emerged. This lead to works inspired by notions of typicality and random 


7l|. Also, notions of qnantnm information propagation, snch as Lieb-Robinson 


bounds 72l-l75|. and research on entanglement in many-body systems 2ll. I76l-l80| can be 
classihed as contribnting to this development. 

All in all, this is already too large a topic to cover in fnll depth in a single 
review of reasonable length. Hence, in this article we address and cover only a 
snbset of these developments and questions. We will mostly concentrate on the 
theoretical and analytical insights, however always making an effort to put them into the 
context of evidence collected through numerical simulations and important experimental 
developments. 

In physics, one can often say a “lot about little”, or “little about a lot”. In this 
review, we take the latter approach, by sticking to general and conceptual statements 
on interacting many-body systems in a quantum-information inspired rigorous language, 
so where only relatively “little” can be said. These statements, however, apply to “a lot”, 
that is, to an immense variety of models. 

At the heart of the approach advocated here lies the attempt to use only 
standard quantum mechanics and no additional postulates to explain the emergence of 
thermodynamic behaviour, and to do this in a mathematically rigorous and general way. 
It is an invitation to elaborate how much of statistical mechanics and thermodynamics 
can be derived from quantum mechanics. The term derived’’' here means to justify the 
well established methods and postulates of equilibrium and non-equilibrium statistical 
mechanics by means of the microscopic picture provided by quantum mechanics. 


671 . I81l |. we shall call this approach pure state quantum statistical 


Following Refs. 
mechanics. 

The level of detail and rigour that we are aiming at in this work necessarily also 
mean that an awkwardly large number of interesting questions and research results will 
have to be left unmentioned. In this sense, this article is not meant to be a comprehensive 
review. 

(i) We have authored together with Mathis Friesdorf a complementing accompanying 


review 


82| in Nature Physics that takes a much more physical perspective, where 


local interacting many-body systems out of equilibrium are in the focus of attention 
and experimental developments are more comprehensively discussed. 

A lot what is left out here is covered there. In this article, in contrast, we advocate a 
more mathematical mindset and language, and at the same time have a more limited 
scope, but the covered topics are discussed in more depth. To complete the picture of 
the subject, and in addition to Ref. j^, we recommend a number of further review 
articles and books that cover what we do not have the space to cover here: 


The book by Gemmer, Michel, and Mahler 69| entitled “Quantum thermodynam¬ 


ics” advertises an approach towards the foundations of thermodynamics that is in 
spirit close to the approach of this work. The focus is, however, more on notions 
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of typicality, which we will discuss in Section El but which is not a central topic of 
the present review. Moreover, the hrst edition of the book is from 2004, and even 
though it has been extended in the second edition from 2009, much of the newer 
material that takes the centre stage in this work is not covered. 


(iii) The editorial of a New Journal of Physics focus issue on the “Dynamics and 
thermalisation in isolated quantum many-body systems” by Cazalilla and Rigol 


83| not only explains the signihcance of the individual articles published in the 


focus issue to the more general endeavour of developing a better understanding of 
the coherent dynamics of quantum many-body systems. On top of that it gives an 
overview of many of the currently pursued research directions and many additional 
references. This renders this editorial a good entry point into the more recent 
literature on the subject and makes it an excellent read. At the same time, it 
provides only very little background information, almost no historical context, and 
assumes that the reader is already familiar with the jargon of the held. 


(iv) A colloquium in Reviews in Modern Physics by Polkovnikov, Sengupta, Silva, 
and Vengalattore 8^ is entitled “Non-equilibrium dynamics of closed interacting 
quantum systems”. This work gives an excellent overview of recent theoretical and 
experimental insights concerning such systems, but focuses mainly on the dynamics 
following so-called quenches, i.e., rapid changes in the Hamiltonian of a system 
and the eigenstate thermalisation hypothesis (ETH). We will discuss the ETH in 
Section 17.21 but the scope of the present work is considerably broader and we will 
also take a slightly different, quantum information theory inspired, point of view 
and put the focus more on analytical results. 


v) A review entitled “Equilibration and thermalisation in hnite quantum systems” by 


VI 


Yukalov ^ contains a review of the history of both the experimental realisation 
of coherently evolving, well controlled quantum systems and the observation and 
numerical investigation of equilibration and thermalisation in such systems. In 
addition it contains results on equilibration in closed systems with a continuous 
density of states and in systems undergoing so-called non-destructive measurements. 
The review Ref. 8^ on the thermodynamics of stochastic processes. It covers 


important topics such as fluctuation(-dissipation) theorems, entropy production, 
and (autonomous) thermal machines, which have been extensively studied in recent 
years and which are related to but not elaborated on in this work. 


(vii) Finally, the review “The role of quantum information in thermodynamics” [8^ 
overviews recent developments in the interplay between the helds of quantum 
information and thermodynamics. It focuses on foundations of statistical mechanics 
and on resource-theoretic aspects of thermodynamics. More explicitly, it covers 
equilibration and thermalisation, state transformation under different constraints 
and resources, work extraction, the work cost of information-processing tasks, 
inconvertibility of energy and correlations, and fluctuation relations. 


These articles and books together, in conjunction with the present review, rather 
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accurately cover the state of affairs. It is the purpose of this article to hll the gap left 
by the above mentioned works. 

2. Preliminaries and notation 

In order to facilitate the discussion in later chapters, we carefully introduce the notation 
and introduce a number of fundamental concepts in this section. The presentation 
is limited to the minimum necessary to make the following statements well-dehned. 
An effort has been made to make this introduction self-contained. However, a basic 
knowledge of quantum theory, analysis, linear algebra, group theory and related subjects 
is assumed. 

To begin with, we £x some general notation. Given a positive integer n G TA we 
use the short hand notation [n] := (1,... ,n) for the (ordered) range of numbers from 

1 to n and set [cxd] := Z"*". Given a set X we denote its cardinality by |X|. If X has a 

universal superset V D X, we write X'^ := V \ X for its complement. Given two sets 
X, Y we write X UY and X OY for their union and intersection. To stress that a set V 
is the union of two disjoint sets X, Y, i.e., X OY = 0 we write V = X UY. Given a set 
X of sets we write UX := IJxex^ union of the sets in X. For sequences S, l^l 

denotes the length of the sequence. When we define sets or sequences in terms of their 
elements we use curly { ■ } or round (■) brackets respectively. 

We use the (Bachmann-)Landau symbols O, hi and 0 to denote asymptotic growth 
rates of real functions f, g: M. ^ M.. In particular 

f{x) G 0(fl'(a;)) limsup |/(a;)/ 5 ((a;)| < cx), (1) 

x—yoo 

and for hi we adopt the convention from complexity theory that 

f{x) G Ol{g{x)) g{x) G 0{f{x)) (2) 

and write f{x) G Q{g{x)) if both f{x) G 0{g{x)) and f{x) G r2(5f(a;)). To simplify 
the notation we work with natural, or Planck units such that in particular the Planck 
constant h and the Boltzmann constant ks are equal to 1. 

Let "H be a separable Hilbert space over C with inner product {(plip) for \ip), \'ip) G Pi. 
We denote by B{Pi) be the space of bounded operators and by T{Pi) the space of trace 
class operators on the Hilbert space Pi, i.e., those A G B{Pi) whose trace Tr A is finite. 
The trace class operators p G P{Pi), whose associated linear functional Tr(p ■) is non¬ 
negative, i.e., VA > 0: Tr(pA) > 0 and which have unit trace Trp = 1, form the convex 
set S{Pi) of (quantum) states or density operators. An operator A G B{'H) is self-adjoint 
if A = Ah An operator H G B{'H) is a projector if HH = H. The rank of an operator 
A G B{Pi), denoted by rank(A), is the dimension of its image. An operator U G B{'H) 
is called unitary ii W U = UW = 1. It turns out that in the finite dimensional setting 
considered here S{Pi) C 0{Pi) is the convex set of self-adjoint, non-negative operators 
with unit trace. The extreme point of that set are rank one projectors and are called 
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pure states. The elements of the subspace 0{'H) C B{'H) of self-adjoint operators are 
called observables. 

Given a bounded operator A G B{'H) and a state p 6 SiTL), we will write the 
expectation value of A in state p is as 


{A), ■.= Tt{Ap). 


( 3 ) 


The most general measurements possible in quantum mechanics are so-called 
positive operator valued measurements (POVMs) 87]. A POVM with K measurement 
outcomes is a sequence M = of operators G B{'H), called POVM elements, 

with the property that 


K 

E 

k=l 


Mfc = 1. 


( 4 ) 


Upon measuring a system in state p G S{'H) with the POVM M, outcome number k 
is obtained with probability Tr(Mfcp). When we say that an observable A G 0{'H), 
with spectral decomposition A = is measured, we mean that the POVM 

M = (Iffc)^^^ is measured and the measurement device outputs the value when 
outcome k is obtained. The average value output by the device in measurements of 
identically prepared systems is then indeed given by Eq. (IHl). A measurement of a 
POVM where all the POVM elements are projectors is called a projective measurement. 
The measurement statistic of a POVM in a state p is the vector of probabilities Tr(Mfc p). 

The most general (quantum) operations in quantum mechanics are capture d by 
so-called completely positive trace preserving maps, also-called quantum channels 87] . 
We call maps B{TL) — B{Ti) superoperators. We denote the identity superoperator by 
id: B{'H) — B{Ti). A linear map G: 0{Ti) —)■ 0{Ti) is then called completely positive 
trace preserving if for all separable Hilbert spaces Pi' it holds that 


\/peS{H®H'): {C®iA)peS{n®H'). 


( 5 ) 


In the hnite dimensional setting considered here, it turns out that hxing Pi' = PL i\i 
Eq. ([5]) already gives a necessary and sufficient condition for a map G: 0{Pi) ^ 0{Pi) 
to be completely positive trace preserving j^. We denote the set of all completely 
positive trace preserving maps on S{Pi) by T^{Pi). 

Throughout most of this review we will work in the framework of hnite dimensional 
quantum mechanics. That is, if not explicitly stated otherwise, we consider systems 
that are described by a Hilbert space Pi over C whose dimension d := dim('H) is hnite, 
bosonic systems constituting an important exception. 

For every 1 < p < cxo the Schatten p-norm of an operator A G B{Pi) is dehned as 

881 




( 6 ) 


-i=i 

where is the ordered, i.e., Si(A) > • • • > Srf(A), sequence of non-negative, real 

singular values of A. We refer to the Schatten cxD-norm as the operator norm and call 
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the Schatten 1-norm trace norm The Schatten j9-norms are ordered in the sense that 

881 


WAeBin): IIAll < 


p' 


p> p 


and in the converse direction the following inequalities hold 

II ■ 111 <Vd\\ ■ II 2 < c^ll ■ IL- 


88 


(7) 


( 8 ) 


For quantum states a natural and frequently used distance measure is the trace 


distance 


87 


Vp, a e S{V.): 


I’{p,a) := - ||p-cr|li 


(9) 

It is, up to the factor of 1/2, the metric induced by the trace norm || • ||j^. Its relevance 
stems from the fact that it is equal to the maximal difference between the expectation 
values of all normalised observables in the states p and a, i.e., 871 

V{p,a) = maxAeoCK): o<A<i Tr(Ap) -Tr(Acr). (10) 

The trace distance is non-increasing under completely positive trace preserving maps 
C G T^iTL), i.e., V{C{p),C{a)) < V{p,a) and invariant under unitary operations, i.e., 
D{UpU^),U crt/l) = D{p,a). Moreover, if one is given an unknown quantum system 
and is promised that with probability 1/2 it is either in state p or state a, then the 
maximal achievable probability Pmax for correctly identifying the state after a single 
measurement of the optimal observable from Eq. (ITOl) is given by 

l + V{p,a) 


89 


90 


Pmax — 


( 11 ) 


Inspired by this, one can define the distinguishability of two quantum states under 
a restricted set AA of POVMs. The optimal success probability for single shot state 
discrimination is then again given by an expression of the form ffTTj) . but with D{p,a) 
replaced by 


91 


sup ^ V] 1 Tr(Mfc p) - Tr (Mfc a) , 

MeM2 

(12) 

'8>m {P, cr) < 'P {P, cr) . 

(13) 


and it holds that 

with equality for all PiU & S{'H) if and only if Ad is a dense subset of the set of all 
POVMs (9l|. It is worth noting that •) is a pseudometric on 8(71), i.e., it is 

a symmetric, positive semidefinite bilinear form, but Dj^{p,a) = 0 p = a. For 


90 


further properties of the distinguishability Dm see for example Ref. 

Another frequently emplowd distance measure is the fidelity, defined for any two 
quantum states p, cr G S{71) aqil 

,1/2^ 2 


D (p, a) := Tr (p^/^ ^ ^ 1 / 2 ^ V2 j 


( 14 ) 


f Some authors define the fidelity as the square root of the T used here. 
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Similar to the trace distance, the fidelity is symmetric, i.e., IF{p,(j) = p), non¬ 

decreasing nnder completely positive maps, and invariant nnder nnitary operations. 
The fidelity is not a metric, bnt it is related to the trace distance via 1 — < 

P(p, cr) < (1 — J^(p, cr))^/^. For pnre states xjj = IV’XV’I and p = \p){^\ it rednces to the 
sqnare of their overlap IFiplj^ip) = |('0|<p)p. 

The (time independent) Hamiltonian H G 0{'H) of a finite dimensional qnantnm 
system has the spectral decomposition 

d’ 

H = Y,EkT^k (15) 

k=l 

where the 11 ^ G 0{'H) are its orthogonal (and mntnally orthogonal) spectral projectors 
and d' := |spec(Ff)| < d = dim('H) is the nnmber of distinct, ordered (energy) 
eigenvalues Ek E M. of if, i.e., k < I Ek < Ei. The snbspaces on which the 

Ilfc project are called (energy) eigenspaces or energy levels. If if is non-degenerate it 
holds that 11 ^ = \Ek){Ek\ with {\Ek))'l^i a seqnence of orthonormal energy eigenstates 
of if and d := dim('H) the dimension of Ti. 

The Hamiltonian if governs the time evolution p: R —)■ S{'H) of the state of a 
qnantnm system via the (Schrodinger-)von-Neumann-eguation, which in the Schrodinger 
pictnre reads 

^Pit) =-i[H,p{t)]. (16) 

Its formal solntion can be given in terms of the time evolution operator^ which in the 
case of time independent Hamiltonian dynamics is given by the operator exponential 

ViGR: U{t):=e-''^^ (17) 

The time evolved qnantnm state at time t is then 

p{t):^U>{t)p(.0)U(t), ( 18 ) 

with p(0) the initial state at time i = 0. 

The temporal evolntion of the expectation valne of an observable A G 0{'H) then 
solves 

(^)pl,) =Tr(At/t(i)p(0)£;(i)) =Tr(t;(«)AC/t(i)p(0)). (19) 

One can thns eqnally well-define the time evolntion of an observable H: R —)• 0{'H), 

with the initial valne A(0) given by the operator A from Eq. flT^ . by setting A{t) := 
U{t) H(0) Hl(f), and consider a fixed qnantnm state p G S(H), eqnal to the initial state 
p(0) in Eq. (IT^ . Then {A{t))p is eqnal to {A)p{p from Eq. (IT^ for all f G R. The 
time evolntion H: R ^ 0{'H) of an observable in the Heisenberg pictnre solves the 
differential eqnation 

^A(t)^ilH,A(t)]. (20) 

We call all observables A G 0{'H) that commnte with the Hamiltonian, i.e., for 
which [ff, A\ \= AB — B A = conserved guantities. It follows directly from Eq. fl2UI) 
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that the expectation value of all conserved quantities is independent of time, irrespective 
of the initial state, which justifies the name. If the Hamiltonian H is non-degenerate, 
then exactly the observables that are diagonal in the same basis as H are conserved 
quantities. In the presence of degeneracies exactly the observables A G 0{'H) for which 
some basis exists in which both A and H are diagonal are conserved quantities. 

Given a function / depending on time, we define its finite time average 

-T 1 r 

f ■■=7^ I fit), (21) 

and its (infinite) time average 

7 lim f, (22) 

T^oo 

whenever the limit exists. In all cases we will be interested in, the existence of the limit 
in Eq. (|22|) is guaranteed by the theory of (Besicovitch) almost-periodic functions 92 . 

In particular we will encounter the time averaged state cu := p, which is, in the finite 
dimensional case considered here, equal to the initial state p(0) dephased with respect 
to the Hamiltonian H, i.e., u = $h(p(0)), with the de-phasing map acting as 

d' 

p $h(p) := ^ Hfc pHfc (23) 

k=l 

and the sequence of orthogonal spectral projectors of H. 

We will encounter systems consisting of smaller subsystems. Often their 
Hamiltonian can be written as a sum of Hamiltonians that each act non-trivially only 
on certain subsets of the whole system. We will refer to such systems as composite 
(guantum) systems or as locally interacting (guantum) systems, depending on whether 
we want to stress that they consist of multiple parts or that the interaction between the 
parts has a special structure. The notion of locally interacting quantum systems can 
be formalised by means of an interaction (hyper)graph Q := {V,S), which is a pair of a 
vertex set V and an edge set 8. 

The vertex set V is the set of indices labeling the sites of the system and we will 
work under the assumption that |V| < cx). The Hilbert space TL of such a system is 
either, in the case of spin systems, the tensor product Hilbert spaces 

^{x] of the individual sites x G V, or, in the case of fermionic or bosonic systems, the 
Eock space, or a subspace of the latter. 

We will encounter bosons, which usually need to be described using infinite 
dimensional Hilbert spaces, only in Section [T3l hence we want to avoid the technicalities 
of a proper treatment of inhnite dimensional Hilbert spaces and unbounded operators in 
the framework of functional analysis. We will thus only introduce the minimal notation 
necessary to formulate the statements we will discuss in Section 13.31 

The sites x G V of fermionic and bosonic composite systems are often called modes. 
In the case of fermions each mode is equipped with the Hilbert space with 

orthonormal basis ((|n)/))i= 0 ! bosons with the Hilbert space = tfi 
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of square summable sequences with orthonormal basis (|u)f,)^g. For composite systems 
with exactly N fermions or bosons in M modes, i.e., V = [M], the Hilbert space is given 
by a so-called Eock layer. The Fock layer to particle number N is the complex span of 
the orthonormal Eock (basis) states |ui,... ^hm) f or |ui,... ,nM)b respectively, where 
for each x E V, rix is the number of particles in mode x and thus ~ ^ with 

Tlx £ {0,1} in the case of fermions, and rix € [N] in the case of bosons. 

The full Eock space of a system of fermions or bosons is the Hilbert space completion 
of the direct sum of the Fock layers for each possible total particle number. For fermions 
it holds that N < M due to the Pauli exclusion principle, and the resulting Hilbert space 
is hence finite dimensional. In the case of bosons N is independent of M and the Fock 
space is thus inhnite dimensional already for a finite number of modes. 

We define the fermionic and bosonic annihilation operators fx and bx on site x and 
the corresponding creation operators f) and (collectively often referred to as simply 
the fermionic/bosonic operators) via their action on the Fock basis states given by 

fx\ni,.. .,nM)f = .. .,nxi,nx - l,nx+i ,...)/, ( 24 ) 

f)\ni,.. .,nM)f = (1 - nx){-l)^y=^''^\ ...,nxi,nx + l,nx+i ,...)/ ( 25 ) 

and 


• • • 7 \/TI^ iTil , • • • ; ^x 

I 7 • • • 7 

( 26 ) 

bl\ni,..., nM)b = y/nx + l|ui, ...,nx^, 

Tlx “1“ I? ^x+1; • • • 5 ^M^b' 

( 27 ) 

They satisfy the (anti) commutation relations 

{fxjy} = {flfy} = 0, 

{fx, /J} = bx,y, 

( 28 ) 

[bx,by] = [bi,bl] = 0, 

[bx, by] = bx^y. 

( 29 ) 


where for any two operators A,Be B{'H) [^,5] '.= AB — B A is the commutator and 
{A, B} := A B + B A the anti-commutator. We say that A, B commute or anti-commute 
if [A, H] = 0 or {A, B} = 0 respectively. 

Any operator that commutes with the total particle number operator 'YhxEV fx fx 
or Ylxev ^x ^x respectively is called particle number preserving. In systems with particle 
number preserving Hamiltonians a constraint on the particle number can be used to 
make the description of bosonic systems with hnite dimensional Hilbert spaces possible. 
The Hilbert space is then a finite direct sum of Fock layers. We say that a state has a 
finite particle number if it is completely contained in such a finite direct sum of Fock 
layers. 

In systems of fermions, all operators can be written as polynomials of the fermionic 
operators. A polynomial of fermionic operators is called even/odd if it can be written as 
a linear combination of monomials that are each a product of an even/odd number 
of creation and annihilation operators. According to the fermion number parity 
supers election rule j^, only observables that are even polynomials in the fermionic 
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operators can occur in nature. The same holds for the Hamiltonians and density matrices 
of such systems. Consequently, whenever we make statements about systems of fermions 
we assume that all observables, states and the Hamiltonian are even. 

We refer to subsets of the vertex set V as subsystems. Generalising the notation 
introduced for the Hilbert spaces of the individual sites we denote the Hilbert spaces 
associated with a subsystem X C V by Hx and its dimension by dx '■= dim(?{x)- In 
the case of composite systems of fermions or bosons it is understood that if an upper 
bound on the total number of particles has been imposed, then TLx is taken to be the 
direct sum of Fock layers corresponding to the sites in X up to the total number of 
particles. The size of a (sub)system X C V is given by the number of sites or modes 
|X|, not the dimension of the corresponding Hilbert space. 

For spin systems we define the support supp(H) of an operator A G H('H) as the 
smallest subset of V such that A acts like the identity outside of X. For systems of 
fermions or bosons we define the support of an operator via its representation as a 
polynomial in the respective creation and annihilation operators. The support is then 
the set of all site indices x G V for which the polynomial contains a fermionic or bosonic 
operator acting on site x, e.g., fej, or f^. The support of a POVMs is simply the union 
of the supports of its POVM elements. Similarly, we define the support supp(C') of a 
superoperator C: B{Ti) — B{Ti) as the smallest subset of V such that 

VH G B{TL)-. supp(H) C supp(C)'^ C{A) = A. (30) 

We say that an observable, POVM, or superoperator is local if the size of its support is 
small compared to and/or independent of the system size. 

In order to fully exploit the notion of a subsystem we need to understand how 
the description of a joint system fits together with the description of a subsystem 
as an isolated system, i.e., how systems can be combined and decomposed. For 
every subsystem X C V there is a canonical embedding of B{'Hx) into B{'H) that 
bijectively maps B{Tix) onto the subalgebra of bounded linear operators A G B{Ti) 
with supp(H) C X, and similarly for all operators that are polynomials of bosonic 
operators. In the case of spin systems the embedding is simply the natural embedding 
A G B{TLx) ^ 0 £ 'S('H), where Ix^^ denotes the identity operator on 'Hx<^- In 

systems of fermions or bosons we associate to each operator on TLx the operator on Ti 
that has the same representation as a polynomial in the fermionic/bosonic operators, 
but, of course, in terms of the fermionic/bosonic operators of the full system with Fock 
space Ti rather than the fermionic/bosonic operators that act on TLx- For systems of 
fermions, because of the phase in Eq. (|2T|) that depends non-locally on the state, this 
embedding depends on the exact position the sites in X have in the vertex set V. The 
vertex set should hence rather be called vertex sequence, but for even operators the 
phases cancel out, which is why we ignore this subtlety. 

Conversely, for any A G B{'H) and any subsystem X C V: X D supp(H) that 
contains supp(H) we define the truncation A^x ^ B{Tix) of A as the operator that 
acts on the sites/modes in the subsystem X “in the same way” as A, in the sense that 
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a truncation followed by a canonical embedding gives back the original operator. In 
particular, for spin systems any A G B{T-i) is of the form A = Afsupp(yi) ® Isupp(yi)''- For 
general systems, the identity operator 1 of course satisfies \x = for any X C V. 

We now turn to the edge set. The edge set 8 is the set of all subsystems X C V 
for which a non-trivial Hamiltonian term Hx with supp{Hx) = X exists that couples 
the sites in X. The Hamiltonian of a locally interacting quantum system with edge set 
8 — often just called a local Hamiltonian — is of the form 

H=J2Hx, (31) 

x&e 

with supp(ifj!s:) = X for all X G T. Most Hamiltonians in the condensed-matter context 
or of cold atoms in optical lattices can be very well approximated by such locally 
interacting Hamiltonians. Generalising this notation to subsystems X C V that are 
not in 8 we define for any subsystem X C V the restricted Hamiltonian 

Hx := ify G 0{n), (32) 

Y£S-. vex 

which obviously fulfils supp(Trx) ^ X. Note that we adopt the convention that Hx is 
an element of 0{TL) and not of 0{TLx)- 

We will also need the graph distance. In order to define it, we first need to give a 
precise meaning to a couple of intuitive terms: We say that two subsystems X, H C V 
overlap if X fl X 7 ^ 0, a set X C V and a set F C T overlap if F contains an edge that 
overlaps with X, and two sets F,F' <Z 8 overlap if F overlaps with any of the edges in 
F'. A subset F <Z 8 oi the edge set connects X and X if F contains all elements of some 
sequence of pairwise overlapping edges such that the first overlaps with X and the last 
overlaps with X and similarly for sites x,y eV. 

The (graph) distance d(X, X) of two subsets X, X C V with respect to the 
(hyper)graph (V, T) is zero if X and X overlap and otherwise equal to the size of the 
smallest subset of 8 that connects X and X. The diameter of a set F C T is the largest 
graph distance between any two sets X, X G F. We extend the definition of the graph 
distance to operators A,Be B{'H) and set d{A,B) := d(supp(A), supp(F)). 

We will also make use of the notion of reduced states, or marginals. Given a quantum 
state p G S{H) of a composite system with subsystem X C V we write for the reduced 
state on X, which is defined as the unique quantum state p^ G S{Tix) with the property 
that for any observable A G 0{'H) with supp(y4) C X 

Tr(ArxP^) = Tr(Ap). (33) 

Defining the reduced state in systems of fermions in this way is important to avoid 
ambiguities j^. We will denote the linear map p 1 —)■ by Tr^^. As Trx= is linear we 
can naturally extend its domain to all of B{'H) so that 


Trxc: B{H)^B{Hx). 


(34) 
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In the case of spin systems Tr^c is indeed the partial trace over = V\X as dehned for 
example in Ref. 87|. For time evolntions p: R —iS('H) we use the natural generalisation 
of the superscript notation, i.e., = Tr^^ op: R —)■ S{TLx)- 

Correlations play a central role in the description of composite systems and hence in 
condensed matter physics and statistical mechanics. It is beyond the scope of this work 
to give a comprehensive overview of the different types and measures of correlations (see 


for example Refs. 871.195l-l97|). One important measure of correlation is the covariance, 


which for a quantum state p G S{'H) and two operators A, B & B{'H) is dehned to be 


covp{A,B) := Tt{pAB) — Tr(py4) Tr(pi?). 


(35) 


It satishes 

cov,(AB)I< ((5l\{S\)‘''" (36) 

and hence one often dehnes the correlation coefficient as covp{A, B)/{{A^)p {B'^) 

We will encounter a slightly generalised version of the covariance in Section [101 

The covariance is most interesting as a correlation measure if A and B act on disjoint 
subsystems, i.e., supp(y4) O supp(R) = 0. If for a given state p G Sill) of a bipartite 
system with V = X ilY and any two observables A,B & 0{'H) with supp(y4) C X and 
supp(i?) C y it holds that coVp(R, R) = 0, then we say that p is uncorrelated with 
respect to the bipartition V = X ilY. 

Uncorrelated states of spin systems are product states. Consider a bipartite spin 
system with Hilbert space H and vertex set V = X (JY. A quantum state p G S{'H) is 
said to be product with respect to this bipartition if p = p^ ( 8 ) p^. We call a basis that 
consists entirely of product states a product basis. 

Still in the setting of a bipartite spin system with Hilbert space H and vertex set 
V = X U y, all quantum states of the form 

P = '^Pjpf®p] (37) 

3 

with {pj)j a probability vector, i.e., 'YljPj — 1 pj > 0 for all j, and pf G S{Tix) and 
pj G S{'Hy) for all j, are called separable with respect to the bipartition V = Xljy. All 
states that can be prepared with local operations and classical communication (LOCC) 
are called separable, a notion that also holds true for bosonic or fermionic systems. Such 
states are correlated in general, but a classical mechanism can be held responsible for 
the correlations present. All states that are not separable are called entangled. 

The Gibbs state or thermal state of a system with Hilbert space TL and Hamiltonian 
H G 0{'H) at inverse temperature /9 G R is dehned as 

9|ffl(/3)-A^e5(W), (38) 

where Z[H] is the (canonical) partition function dehned as 

Z[H]{Y) ■■= Tr(e-^^). 


(39) 
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The Gibbs state has the important property that it is the unique quantum state that 
maximises the von Neumann entropy 


S{p) ■■= -Tr(plog2p). 


(40) 


given the expectation value of the Hamiltonian j^. This is a direct consequence of 
881 and the fact that the same statement holds in classical statistical 


Schur’s lemma 

mechanics, as can be seen from a straight forward application of the Lagrange multiplier 
technique. In fact, the inverse temperature fl is nothing but the Lagrange parameter 
associated with the energy expectation value. 

For locally interacting quantum systems with a Hamiltonian H G 0{'H) of the form 
given in (l3T]) we adopt the convention that for any subsystem X G V 


= Trv.(9|^/l(/3)) e S{Hx) 


(41) 


denotes the reduction of the Gibbs state of the full system to the subsystem X (compare 
Eq. 

The micro-canonical ensemble in quantum statistical mechanics takes the form of 
the micro-canonical state. Usually one defines the micro-canonical ensemble and state 
with respect to an energy interval [E, E A], Here we make the slightly more general 
definition that will be useful later: The micro-canonical state to any subset i? C R of 
the real numbers of a system with Hilbert space TL and Hamiltonian H G 0{'H) with 
spectral decomposition H = EkAk is defined as 

n|g|(fl) := e S(-H), (42) 

where Zmc[H] is the micro-canonical partition function defined as 

Z^,[H]{R) :=Tr( H^). (43) 


3. Equilibration 


The dynamics of finit e dimensional quantum system, as described in the previous section. 


99l4l03| and time rev ersal invariant. Hence, genuine equilibration in the 


is recurrent ^ _ 

sense of Boltzmann’s H-Theorem 104| that implies that entropy can only grow over 
time (see also Section lA.l.lD is impossible. This apparent contradiction between the 
microscopic theory of quantum mechanics and the thermodynamic behaviour observed 
in nature is one of the main issues that any derivation of statistical mechanics and 
thermodynamics from quantum theory needs to solve. 

We will see in this section that the unitary time evolution of pure states of such 
systems does imply in a surprisingly general and natural way that certain time dependent 
properties of quantum systems do dynamically equilibrate and that hence this apparent 
contradiction can be resolved to a large extend. 
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We will concentrate on two notions of eqnilibration: equilibration on average and 
equilibration during intervals. After an introduction of these two notions in Section 13.11 
we will discuss them in detail in Sections [32] and [331 In particular we will give conditions 
under which equilibration in the respective sense can be ensured. In Section 13.41 we 
touch upon other notions of equilibration that have been investigated in the literature. 
Then we discuss Lieb-Robinson bounds, which limit the signal propagation in locally 
interacting quantum lattice systems, in Section 13.51 before we go on to survey results 
on the times scale on which equilibration happens in Section 13.61 We end this section 
with a brief description of fidelity decay in Section 13.71 In the next section. Section [T] 
we then put the discussed rigorous results into the perspective of the picture emerging 
from numerical simulations and the insights gained from analytic investigations of more 
specihc models. 

3.1. Notions of equilibration 

In this section we dehne and compare two notions of equilibration compatible with 
the recurrent and time reversal invariant nature of unitary quantum dynamics in hnite 
dimensional systems. These notions will capture the intuition that equilibration means 
that a quantity, after having been initialised at a non-equilibrium value, evolves towards 
some value and then stays close to it for an extended amount of time. At the same 
time, what we will call equilibration is less than what one usually associates with the 
evolution towards thermal equilibrium. We will define a quantum version of the latter, 
call it thermalisation, and discuss it in detail in Section [7] 

To keep the dehnition of equilibration as general as possible we will refer abstractly 
to time dependent properties of quantum systems, by which we mean functions /: M —)■ 
M that map time to some metric space M, for example M or S{'H). The metric will 
allow us to quantify how close the value of such functions is for different times and in 
particular how close it is to the time average and “equilibrium values” of the function. 

Properties that we will be interested in include for example the time evolution 
of expectation values of individual observables. We will also encounter subsystem 
equilibration. In this case the property is the time evolution of the state of the 
subsystem and the metric the trace distance. It will also be convenient to speak more 
generally of the apparent equilibration of the whole system with the metric then being 
the distinguishability under a restricted set of POVMs. 

We will discuss the following two notions of equilibration in more detail; 

Equilibration on average: We say that a time dependent property equilibrates on 
average if its value is for most times during the evolution close to some equilibrium 
value. 

Equilibration during intervals: We say that a time dependent property equilibrates 
during a (time) interval if its value is close to some equilibrium value for all times 
in that interval. 
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The use of the notion of equilibration on average in the quantum setting goes back 


to at least the work of v on Neuni ann |3| and has recently been developed further, in 
particular in Refs, We will see that equilibration on average, especially 

for expectation values of observables as well as for reduced states of small subsystems 
of large quantum systems, is provably a very generic feature. In contrast, equilibration 
during intervals is a property that is expected to be generically the case for locally 
interacting many-body systems, and there is compelling numerical evidence for such a 
behaviour. To date, however, it has rigorously been proven only for specihc models 


llll.lll2|. 


Equilibration on average implies that the equilibrating property spends most of the 
time during the evolution close to its time average. This allows for a reasonable dehnition 
of an equilibrium state, which is then the time averaged or de-phased state. As we will 
see later in Section [71 this makes it possible to tackle the question of thermalisation in 
unitarily evolving quantum systems. 

On the down side, a proof of equilibration on average alone does not immediately 
imply much about the time scale on which the equilibrium value is reached after a 
system is started in an out of equilibrium situation. We will see that even though it 
is possible to bound these time scales, the bounds obtainable in the general settings 
considered here are only of very limited physical relevance (see Section 13.6p . 

As we will see in the following, the statements on equilibration during intervals 
are much more powerful in this respect. They imply bounds on the time it takes 
to equilibrate that scale reasonably with the size of the system and hence prove 
equilibration on experimentally relevant time scales. On the other hand, in the few 
settings in which equilibration during intervals of reduced states of subsystems has been 
proven, it is known that the equilibrium states are not close to thermal states of suitably 
restricted Hamiltonians. In particulate, no proof of thermalisation (in the sense of the 
word we will dehned later in Section 17.11) based on a result on equilibration during 
intervals is known to date. We discuss both notions of equilibration in detail in the 
following two sections. 


3.2. Equilibration on average 

In this section we discuss equilibration on average (see hgure [2] for a graphical 
illustration). The outline is as follows: After giving some historic perspective we will 
go through the main ingredients that feature in the known results on equilibration on 
average and discuss their role in the arguments and to what extent they are physically 
reasonable and mathematically necessary. After this preparation we will state, prove 
and interpret the arguably strongest result on equilibration on average known to date. 

Already the founding fathers of quantum mechanics realised that the unitary 
evolution of large, closed quantum systems, together with the immensely high dimension 
of their Hilbert space and quantum mechanical uncertainty, could possibly explain the 
phenomenon of equilibration. Most notable is an article of von Neumann from 
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1929, which already contains a lot of the ideas and even variants of some of the resnlts 
that can be found in the modern literature on the subject. The renewed interest in 
the topic of equilibrati on was to a large extent a consequence of the two independent 
theoretical works Refs. 1061. Il07| . The approach outlined there was then more recently 
rehned and t he re sults gradually strengthened. Important contributions are in parti cula r 
Refs. j^. 1^ Il09| . Also very noteworthy is the often overlooked earlier work Ref. 1051. 

The hrst fact that plays a prominent role in the proofs of equilibration on average 
is the immensely high dimension of the Hilbert space of most many-body systems. 
The dimension of the Hilbert space of composite systems grows exponentially with the 
number of constituents. What actually matters, of course, is the number of significantly 
occupied energy levels, rather than the number of levels that are in principle available 
but not populated. For each k G [d'] we dehne the occupation pk '■= Tr(nfcp(0)) of 
the k-th energy le vel, wher e d' := |spec(iif)| < d = dim('H) is the number of distinct 
such levels. Refs. 1051 Il06| use maxfcPfc, the occupation of the m ost o ccupied level, to 
quantify the number of signihcantly occupied energy levels. Ref. uses a quantity 
called effective dimension, denoted by which in our notation can be dehned as 


d' 


eff, 


OJ) \= 


Et=iPl 


> 


(44) 


If the initial state is taken to be an energy eigenstate, the resulting effective dimension is 
one, while that resulting from a uniform coherent superposition of d energy eigenstates 


to different energies is d. This justihes the interpretation of as a measure of the 

number of signihcantly occupied states. It is also reciprocal to a quanti ty th at is known 
mostly in the condensed matter literature as i nvers e participation ratio Ci and related 
to the time average of the Loschmidt echo 66l . Ill4| . While using the effective dimension 
instead of the occupation of the most occupied level can lead to tighter bounds it has the 
disadvantage that it cannot be efficiently computed given a state and the Hamiltonian. 

There are a number of different ways to argue why it is acceptable to restrict 
oneself to initial states that populate a large number of energy levels when trying to 
prove the emergence of thermodynamic behaviour from the unitary dynamics of closed 
systems. First, one can argue that initial states that only occupy a small subspace 
of the Hilbert space of a large system behave essentially like small quantum systems 
and such systems are anyway not expected to behave thermodynamically, but rather 
show genuine quantum behaviour. Second, one can invoke the inevitable limits to the 
resolution and precision of experimental equipment to conclude that preparing states 
that overlap only with a handful of the roughly 2^°^^ energy levels of a macroscopic 
system is impossible, even if we had apparatuses that wer e many orders of magnitude 
more precise than the equipment available today 106, ESI- Finally, one can also take a 
more mathematical point of view and use results based on a phenomenon called measure 
concentration 1151 Ill6| that guarantees that uniformly random pure states drawn from 


sufficiently large subspaces of a Hilbert space have, with extremely high probability, an 
effective dimension with respect to any hxed, sufficiently non-degenerate Hamiltonian 
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Figure 1. (Reproduction from Ref. [l20|) Rlustration of the non-degenerate energy 
gaps condition. No gap between two energy levels may occur more than once in the 
spectrum, but the individual levels may well be degenerate. 


that is comparable to the dimension of that snbspace 68|, llOTI. IllTI. Ill8| (more on 
snch typicality argnments in Section [6]). If one is willing to assnme that snch states 
are physically natnral initial states, this can jnstify the assnmption of a large effective 
dimension. We will come back to this in Section [6] where we discnss typicality. For 
an earlier work that directly min gles typicality argnments and de-phasing to derive an 
eqnilibration resnlt see also Ref. (13. 

As we will see below, it is actnally snfficient for eqnilibration that max'^pfe, the 
second largest of the energy level occnpations, is small. Note that in the physically 
relevant sitnation of a system that is cooled close to its gronnd state max'^p^, can be 
orders of magnitnde smaller than max^ nt, or Althongh the proof of this 


extension of previons resnlts is not trivial 109j, the physical intnition behind it is clear: 
The expectation valnes of all observables of a system that is initialised in an energy 
eigenstate are already in eqnilibrinm. What can prevent eqnilibration on average are not 
macroscopic popnlations of one energy level, bnt rather initial states that are coherent 
snperpositions of a small nnmber of energy eigenstates. Snch states can show a behavionr 

reminiscent of Rabi-Oscillations and not exhibit eqnilibrat ion. _ 

The second main ingredient to the proofs of Refs. is the condition of 

non-degenerate energy gaps originally called the non-resonance condition. We say that 
a Hamiltonian H has non-degenerate energy gaps, if for every k, l,m,n & [d'] 


Ek — El — En 


En 


{k = lAm = n)\/{k = mAl = n), 


(45) 


i.e., if every energy gap Ek — Ei appears exactly o nce in the spectrnm of H (see £gnre[T]). 
The original condition used in Refs, j^, 105 - 107 1 is stronger and excludes in addition all 
Hamiltonians with degeneracies, i.e., requires that d' = d. Although the non-degenerate 
energy gaps condition appears to be pretty technical at hrst sight, the motivation for 
impo sing it can be made apparent by the following consideration; The main concern of 
Ref. 107| is the equilibration on average of the reduced state p^(t) of a small subsystem 


S' of a bipartite system with V = S U B. If the Hamiltonian of the composite system is 
of the form 

H = Hs + Hb, (46) 


i.e., S and B are not coupled (remember the dehnition of the restricted Hamiltonian 
in Eq. (I3^ ). then p^{t) will simply evolve unitarily and equilibration of p^{t) 
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is clearly impossible. Hence, one needs a condition that excludes such non¬ 
interacting Hamiltonians. Imposing the condition of non-degenerate energy gaps is 
a mathematically elegant, simple, and natural way to do this. It is easy to see 
that Hamiltonians of the form given in Eq. (H6|) have many degenerate gaps, as their 
eigenvalues are simply sums of the eigenvalues of Hs and Hb- 

In the more recent literature, the condition of non-degenerate energy gaps has been 


gradually weakened, 
interval of width e. 


Ref. 


9l| dehnes the maximal number of energy gaps in any energy 


N{e) := sup|{(fc,/) G [d’Y: k Y I A Ek - Ei E [E,E + e]}\. 


(47) 


Note that iV(0) is the number of degenerate energy gaps and a Hamiltonian H satishes 
the non-degenerate energy gaps condition if and only if A^(0) = 1. The above dehnition 
allows to prove an equilibration theorem that still works if a system has a small 
number of degenerate energy gaps. Moreover, it has the advantage that it allows to 
make statements about the equilibration time. As we will see in the next theorem, 
equilibration on average can be guaranteed to happen on a time scale T that is large 
enough such that T e 3> 1 where e must be chosen small enough such that N (e) is small 
compared to the number of signihcantly populated energy levels. 

The arguably strongest and most general result concerning equilibration on av erag e 


in quantum systems can be obtained by combining the two recent works Refs. |9lL 11091 


In fact, we will see that it even goes slightly beyond a mere proof of equilibration on 
average, as it does have non-trivial implications for the time scales on which equilibration 
happens. 

Theorem 1 (Equilibration on average). Given a system with Hilbert space Ti and 
Hamiltonian H G 0{Ti) with spectral decomposition H = Ylt=i Ek H^. Eor p(0) G iS('H) 
the initial state of the system, let u = $h{p{^)) be the de-phased state and define the 
energy level occupations pk '■= Tr(nfc /o(0)). Then, for every e,T > it holds that (i) for 
any operator A G B{Ti) 


mm - {Am < PIL N{e)fieT)gm)t=Y 


and (a) for every set AA of POVMs 


VM{p{t),uy <h{M) [^N{e)f{eT)g{{pQti 
where N{e) is defined in Eg. (IT7|) . f{eT) := 1 -|- 8 \og 2 {d')/{eT), 


1/2 


(48) 


(49) 


s((pt)iLi) := miii(^p|,3max>s), 


k=l 


and 


h{M) ■■= min(|U>l|/4,dim('Hsupp(;K))/2), 


(50) 

(51) 


with max'fcPfc the second largest element in {pk)k=i! ONi the set of all distinct POVM 
elements in M, and supp(Ad) := supp(M). 
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Figure 2. (Reproduced from Ref. [120|) Equilibration on average is compatible 
with the time reversal invariant and recurrent nature of the time evolution of 
finite dimensional quantum systems. The figure shows a prototypical example of 
equilibration on average. Started in a non-equilibrium initial condition at time 0 the 
expectation value of some observable A quickly relaxes towards the equilibrium value 
{A)^ and then fluctuates around it, with far excursions from equilibrium being rare. 
After very long times the system returns (close to) its initial state and so does the 
expectation value of the observable. A similar behaviour is observed when the initial 
state is evolved backwards in time. 


Proof. Eq. (H8|) ioi equal to the first argument of the min in Eq. fl50|) is 
Theorem 1 in Ref. |9l| . The same statement, but with g{{pk)k=i) eq ual to the second 
argument in the min, follows from Eqs. (44), (50), (61), and (63) in Ref. jinol ]. With |U17| 


in Eq. (1^ replaced by the total number of all measurement outcomes, i.e., Yhu&M 1^1’ 
Eq. (UnD, for g{{pk)i=i) equa 
Theorems 2 and 3 from Ref. 


to the hrst argument of the min in Eq. ([50]), is implied by 


91]. A careful inspection of Eq. (B.l) in Ref. 9l|, however. 


reveals that the slightly stronger result holds. In particular, one can hrst use the bound 


maxM(t)GxA)M(t)(p(t),a;) < ^ \tr{Ma p{t)) - tr{MaU:)\ (52) 

Ma&yJM 


for the argument of the time average in the right hand side of the hrst line of Eq. (B.l) 
and then use the triangle inequality to pull the time average into the sum. For g{{pk)k=i) 
equal to the second argument the result follows using Eq. fl4^ instead of Theorem 1 
from Ref. in the proofs of Theorems 2 and 3 from Ref. j9ll |. □ 

What is the physical meaning of the theorem? The quantity g{{pk)k=i) is small, 
except if the initial state assigns large populations to few (but more than one) energy 
levels. For initial states with a reasonable energy uncertainty and large enough systems 
it can be expected to be of the order of 0 {l/d'), i.e., reciprocal to the total number 
of distinct energy levels. The quantity h{Ai) on the other hand can be thought of 
as a measure of the experimental capabilities in distinguishing quantum states and can 
reasonably be assumed to be much smaller than d'. In particular, when all measurements 
in A4 have a support contained inside of a small subsystem S' C V it is bounded by 
ds/2. Because of the conditions for equality in Eq. flT^ . the theorem then also implies 

-T 

an upper bound on and hence proves subsystem equilibration on average. 
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For fixed H and e > 0 we have lini'r^oo /(eT) = 1, hence the theorem proves, for 
a wide class of reasonable initial states, equilibration on average of all sufficiently small 
subsystems and apparent equilibration on average of the state of the full system under 
realistic restrictions on the number of different measuremen ts that c an be performed. 
In this sense it improves and generalises the results of Refs. 1061 . Iin7| . 

On what time scales is equilibrium reached? The product N{e) f{eT), which is 
lower bounded by one, will typically be close to one only if T is comparable to 
i.e., to the total number of energy gaps, and will otherwise be roughly of the order of 
r2(d'^/T) for smaller T. So, even under the favorable assumption that g{{pk)k=i) of 
the order of O(I/d'), equilibration of a subsystem S can only be guaranteed after a time 
T that is roughly of the order of r2(d|d'). Both d' and ds typically grow exponentially 
with the size of the composite system and the subsystem S, respectively. Hence, times of 
the order of r2(d| d') are unphysical already for systems of mod erate size. This weakness 
of theorems such as Theorem [1] has been criticised in Ref. 12l| (see Section [3i6] for more 
details on equilibration times). 

There are at least two possible replies to this criticism: First, it is known 
that there are systems in which equilibration does indeed take extremely long (see 
and thus, being a very general statement. Theorem [1] is probably close to 


Section | 

optimal. Proofs of shorter equilibration times will need further assumptions, such as 
locality or transit ion i nvariance of the Hamiltonian, and restrictions on the allowed 
measurements 


91 


107| . Second, almost all systems in which equilibration has been 


studied and in which equilibration of some property on reasonable time scales could 
be demonstrated were found to e xhibit equ ilibration towards the time average (see for 


example Refs. 42|, 1^, l60|, l66|, l69|, Il22l - ll24| ). so in these cases the upper bound on the 


equilibration time implied by Theorem [T] is not tight, but the theorem still captures the 
relevant physics. Transient equilibration to metastable states that precedes equilibration 
to the time average seems to require special structure in the Hamiltonian. That the 
physics of such special systems is not captured by a result as general as Theorem [1] is 
not too surprising. 

An i nteresting variant of the subsystem equilibration setting is investigated in 
Ref. 125| . in which the subsystem S can initially be correlated (either classically or 
even quantum mechanically) with a reference system R. The “knowledge” about the 
initial state of S stored in the reference R can in pr inciple he lp to distinguish the state 
p^{t) from . Still, by using decoupling theorems 126l - ll28| and properties of smooth 


min and max entropies 1291. Il30| it is possible to show subsystem equilibration on 
average under conditions similar to those of Theorem [H in the sense that the combined 
state of S and R is on average almost indistinguishable from 

In the above disquisition on equilibration we have put a focus on the more recent 
literature, however, many of the ideas behind the results mentioned above can already be 
found in the work of vo n Ne umann j^. We encourage the interested reader to consider 
the En glish translation 


in Ref. 


131 of this article and the discussion of von Neumann’s results 


132| and the brief summary of parts of this article in Section [6] of this work. 
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Further statements concerning equilibration towards the de-phase d state, which are 


related to those discussed above, can also be found in Refs, 
these works in more detail in Section 13.61 


133 


136 


We will discuss 


3.3. Equilibration during intervals 

In this section we investigate under which conditions equilibration during intervals can 
be guaranteed. After a brief overview of the literature on the topic we will concentrate 


on the results presented in Ref. |llll |. Instead of reproducing the full proof we will 
only give the intu ition behind it and describe the general structure. One reason for this 
is that Ref. lll| is concerned with a special class of bosonic Hamiltonians, so-called 


quadratic bosonic Hamiltonians, i.e., Hamiltonians that are quadratic polynomials in 
the bosonic creation and annihilation operators. For these Hamiltonians there exists a 
special formalism based on so-called covariance matrices that allows, for example, to 
calculate for a special class of initial states, namely Gaussian states, the time evolution 
of the expectation values of certain observables in a computationally efficient way. A 
full introduction of this formal ism is be yond the scope of this review. More details can 
be found for example in Refs. 1371 - 1140 1. 

Equilibration during intervals of n qn-G aussian initial states under certain quadratic 
Hamiltonians has be en pr oven in Ref. 112| and the results have later been generalised 


and improved in Ref. |lll| . The techniques are inspired by earlier works |141j on classical 


harmonic crystals, i.e., systems of coupled classical harmonic oscillators, and can be seen 
as bounds on the pre-asymptotic b ehav iour and an ext ension to hnite system sizes of 


the results on equilibration of Ref. 


142 


See also Refs. 


142 


144 for related results on 


equilibration starting from Gaussian initial states. 

More precisely, the results on equilibration during intervals of Ref. 
systems evolving under certain quadratic Hamiltonians of the form 


111 


concern 


1 

2 




K b -\-b K b^) 


(53) 




where bx,bl, are the bosonic annihilation/creation operators on site x G V and K G 


]^|v|x|v|^ The operator FT, as dehned in Eq. (155]) , is unbounded and henc e, in 


prin ciple, a 


careful treatment of the system with the methods of functional analysis |145L Il46| would 
be necessary. The Hamiltonian in Eq. (I53|) is, however, particle number preserving. 


Thus, when we restrict to initial states with hnite particle number the whole evolution 
happens in a hnite dimensional subspace of the Fock space. The Hamiltonian H and all 
relevant observables can then be represented by bounded operators on this subspace. We 
are hence back in the framework of hnite dimensional quantum mechanics as introduced 
in Section [2] and the following statement is well-dehned; 

Theorem 2 (Equilibration during intervals). Consider the class of systems with a 
finite number of bosons in M modes on a ring with nearest neighbour interactions, 
i.e., V = [M] and 8 = {(1, 2), (2, 3),..., (M, 1)}, evolving under a Hamiltonian of the 
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form given in (15S|) with K^ y = modM,i- Let Ti he the direct sum of Fock layers 

up to the maximal particle number. If the initi al sta te p{0) E S{'H) satisfies a form of 
decay of correlations (Assumptions 1-3 in Ref. fllA] ) and has time independent second 
moments (see Ref. fllA] ). then for every S' C V and every e > 0 there exists a system 
size M*, such that for all M > M* there exists a time treiax independent of M and a 
time tree £ such that there exists a Gaussian state u E S{TL) such that 


Vt e [trelax, ^rec] : LI (p^(t),CV^) < 


e. 


(54) 


Proof. The theorem is essentially implied by Theorem 2 and Corollary 1 from Ref. lll| . 
as well as the discussion between th em. The scaling of the times Relax and free follows 


from Eq. (61) and Lemma 4 in Ref. 


Ill 


□ 


The theorem proves equilibration during the interval [treiax, Rec] of all small 
subsystems of a sufficiently large system. It is key to this type of equilibration that 
the state a) is a Gaussian state, even if the system was initially prepared in a non- 
Gaussian state. In fact, a similar convergence to a Gaussian state can be proven even in 
instances where the second moments are not constant in time. Then it is still true that 
non-Gaussian states become locally Gaussian over time, but local expectation values will 
then not become stationary. Again, it is important to note that the class of Hamiltonians 
considered here is special — the Hamiltonians are quadratic in the bosonic operators — 
but this does not apply to the initial states. The technical requirements on the initial 
state allow, for example, for ground states o f ga pped interacting local Hamiltonians. 


These conditions, precisely laid out in Ref. 111| . ask for an algebraic decay of two- 


and four-point-functions, as well as an algebraic decay of correlations between Weyl 
operators belonging to distant regions. 

The time freiax depends on the size of the subsystem S under consideration, but is 
independent of the size of the composite system. It depends on the speed at which the 
Hamiltonian is able to transport correlations through the system and the length scale 
on which the correlations in the initial state decay. The time free is a lower bound on 
the recurrence time and is slightly smaller than the time it takes for a signal to travel 
around the ring of bosonic modes. 


3.4- Other notions of equilibration 

In this section we briefly cover two other notions of equilibration for closed quantum 
systems. Th e hrst alternative notion of equili brati on we want to discuss was proposed 


in Ref. jl32| and further investigated in Ref. 

article of von Neumann Q. There, von Neumann postulates that on large systems only 


147| . This work is closely related to an 


a set of so-called macroscopic observables is accessible. The macroscopic observables 
are required to commute, thus they divide the Hilbert space in subspaces, so-called 
phase cells^ each containing states that belong to the same sequence of eigenvalues for 
all the macroscopic observables (see also the more detailed d iscus sion of Ref. Q in 
Section [6]). If one of the phase cells is particularly large. Ref. jl32l| associates it with 
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thermal equilibrium and says that a system is in thermal equilibrium if and only if its 
state is almost entirely contained in that cell. Variants of the results from Ref. can 
then be used to prove equilibration in this sense. 

Reminding oneself that measurements of quantum systems are ultimately sampling 
experiments opens up an entirely new vista on the problem of equilibration, which leads 
us the second alternative notion of equilibration. Performing a measurement of an 
observable does neither provide the experimentalist with the measurement statistic nor 
does it yield the expectation value of the observable. Both can only be approximately 
determined by repeatedly performing the same experiment many times. How many 
repetitions are needed to distinguish whether the measurement statistic of a given 



of very hne grained observables pure quantum states are practically indistinguishable 
from states corresponding to statistical ensembles. 


3.5. Lieb-Robinson bounds 


An important tool f or the st udy of equilibration phenomena is provided by Lieb- 


Robinson bounds 


72 


Tbilisi 


They limit the speed at which excitations can travel 


through a quantum lattice system equipped with a locally interacting Hamiltonian. 
They can be viewed as an upper bound on group velocity of any excitation. In systems 
satisfying a Lieb-Robinson bound, information propagation is essentially contained 
within a causal cone, reminiscent of a “light cone” or “sound cone” (see Figure |3]). 
Any excitations spreading faster than a maximum velocity are exponentially suppressed 
in the distance. Such bounds make rigorous the expectation that no instantaneous 
information propagation should be possible in quantum lattice models, and thereby 
immediately provide lower bounds to equilibration times for such models. The 
implications of Lieb-Robinson bounds to entanglement dynamics will be discussed in 
Section 14.31 

Concretely, Lieb-Robinson bounds are statements of the following type: 

Theorem 3 (Lieb-Robinson bound (corollary of Theorem 1 from j3|))- Consider 
a locally interacting fermionic or spin system with Hilbert space Ti and Hamiltonian 
H G 0{TL). Let 0{Ti) be observables and denote B(t) := B . Then 


\\[ABmL<C\\A\\^\\B\l 


|t|-d(A,n) 


(65) 


where the Lieb-Robinson speed v depends only on the operator norm of the local terms 
of H and the coordination number maxxef|{V G T: X fl V 7^ 0}| of the interaction 
graph, and C is a constant that depends only on min(| supp(A)|, | supp(R)|). 
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The theorem says that the commutator [A,B{t)] is exponentially suppressed with 
the distance between the support of A and a “light-cone” that grows with the time the 
observable B is evolved under the Hamiltonian H. As A could be one of the local 
terms of H this in particular implies that the distant terms of the Hamiltonian do not 
signihcantly influence the time evolution of B and that for any time t the operator B{t) 
can be approximated by an observable with support only slightly larger than the base 
of the “light-cone” at that time. 

Such “hght-cone”-like dynamics has been systematically explored and put into the 
context of equilibration analytically and numerically 77H79I. If 111 Ill2[ Il56l - ll60| as well 
as experimentally 2l|, l27|, l28|, l30|, Si niilar bou nds also exist for more general 


settings, like local Liouvillian dynamics 7^, ll62L Il63| . exponentially decaying but no 


longer strictly local interactions 72|. a s well as for certain long-ranged, i.e., power law 
like decaying, interactions 72l . ll59l . ll64l | as long as the exponent is sufficiently large. Such 
long-ran ged interactions have been experimentally investigated in systems of trapped 


ions 


21 


161 


Lieb-Robinson bounds can also be proven f or certain systems with Hamiltonians 
with local terms with unbounded operator norm 165l - ll67| . For example, for quadratic 
bosonic systems with Hamiltonians of the form 

^ = ^y + ^-^y ^ ^ (^ 6 ) 


x,y€V 


where bx, are again the bosonic annihilation/creation operators on site x G V a Lieb- 
Robinson bound holds. Writing 


K = 


Qx,y T Pc 


x,y 


^x,y 


Qx,y Px,y 


(67) 


2 2 ' 
such Hamiltonians can be cast into a form reflecting couplings between canonical 
positions ■= + b\.)/\/2 and momenta Px '■= ~ bx)/\/2 

H=^^{qxQx,yqy+PxPx,yPy), Q, P G (58) 

x,yGV 

In this setting, local means that Kx^y = Lx^y = Qx,y = Px,y = 0 for d(x, y) > R for some 
P G N. We write dx,y ■= d{x,y)/R and dehne r := max{||P||QP||oo^^^} |t|, 
then the following Lieb-Robinson bound is valid; 


Theorem 4 (Lieb-Robinson bounds for qnadratic bosonic systems 167j L Consider a 
Hamiltonian of the form given in Eq. (I58|) then 


and 




\\[qxit),Py]\\, 

\\[Px{t),qy]\\^ 


< 


^d^,y+‘i cosh (r) 


(59) 


>^x,y 


< 


T<^c,,y (.Qgp 


d. 


( 60 ) 


x,y 
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(a) (b) 

Figure 3. Schematic depiction of the Lieb-Robinson “light” cones in clean systems (a) 
and the more stringent bounds that can be derived in disordered systems (b). Outside 
the shaded area causal influences are exponentially suppressed. 


That is, for sufficiently large d{x,y), one finds a faster-than-exponential decay of 
commutators between the canonical position and momentum operators. This gives rise 
to a “light cone” with the Lieb-Robinson velocity 


t. = eflmaxdlQFII^y IIPQII^’}. 


(61) 


Despite the results of Refs. 165l - ll67| . a full proof of a Lieb-Robinson bounds for a 
natural, interacting, inhnite dimensional model, such as the Bose-Hubbard model with 
hnite hlling, is to date still missing. 

A problem that has recently started to attract an increasing amount of attention 
is transport in disordered systems. For an XY spin chain with disordered interactions 
and disordered external magnetic held a Lieb-Robinson type bound of the form 


\\[ABm^<Cn^\t\ 


\B\ 


,-r) d{A,B) 


(62) 


with n th e length of the chain and C,t] > 0 constants was derived in Ref. 168| (see also 


Ref. 169| for a similar result). Notice that the bound is signihcantly more stringent 
than the ones we discussed before. In order for the right hand side to be signihcantly 
larger than zero, a time t that scales exponentially with the distance d(y4, B) is necessary 
(see Figure [3]). Hence, in the considered disordered system information propagation is 
constrained to a region that is not even a cone — its radius only grows logarithmic 
with time. There is even evidence that for a given realisation of a disordered XY chain, 
one obtai ns a certain type of zero velocity Lieb-Robinson bound with high probability 


170l. Il71| . We will return to the problem of transport and (many-body) localisation 


m 


disordered systems in Section 


3.6. Time scales for equilibration on average 

In this section we summarise what is known about the time scales on which subsystem 
equilibration to the reduction of the de-phased state happens, i.e., on which time 
scales small subsystems equilibrate towards their time averaged state. We will see 
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that it is possible to go beyond what Theorem [T] implies, but that all analytical results 
known to date that do so have the disadvantage of not being applicable to concrete 
Hamiltonians, but are only statements about all but few Hamiltonians from certain 
probability measures. 

To discuss such results we will need to refer to and use methods of typicality 
and measure concentration. In particular, we will encounter the uniform or Haar 
measure /iHaar[f^(c?)] on fhe unitary group U{d) of dimension d and the probability 
P(7^^Haar[;7(rf)](^) ^ givou statomout A holds for unitary operators U ~ /iHaar[f^(d)] 

drawn from the Haar measure. Readers unfamiliar with these constructions might want 
to refer to Section [6l where we discuss them in more detail. 

We argued in the paragraphs following Theorem [1] that the bounds in Eq. fHHj) and 
Eq. (l49|) can be expected to become meaningful only if T is of the order of d'). As 
d' usually grows exponentially with the system size the equilibration times implied by 
Theorem [1] become physically meaningless already for medium sized systems. 

There are good reasons to believe that without further assumptions on the 
Hamiltonian no significantly better general bounds on the subsystem equilibration 
time can hold. An example of a system that indeed can take exponentially long 
to equilibrate is a bipartite system in which the subsystem is only coupled to a low 
dimensional subspace of the Hilbert space of the bath. It can then take exponentially 
long before the Hamiltonian on the bath has rotated the state of the bath into this 
subspace, there by effectively leaving the subsystem uncoupled for extremely long times 
for a related construction). Such a coupling to a l ow dimensional 


172 


(see Ref. 

subspace is necessarily non-local and hence unphysical. In Ref. 173|, however, it is 


demonstrated that density inhomogeneities can persist also for exponentially long times 
even in translation invariant interacting lattice models. Very slow dynamics is also 
characteristic for disordered and glassy systems. 

Numerical evidence suggests that most natural, locally interacting systems without 
disorder started in reasonable initial states do not exhibit such extremely long 


equ ilibration times, see for example Refs. 0, 0 00, 0, 0-0 


69 


122 


123, 174 


176|, even t houg h surprisingly slowly relaxing local observables can be c onstr ucted in 
177 1 and also power law approaches to equilibrium can occur jl78| . 


some cases 


As it is still unclear how the features of natural many-body models, such as locality 
of int eractions, can be exploited to derive tighter bounds on equilib ratio n time scales. 
Refs. |T0 - fT0. [l0 | instead consider random Hamiltonians and Ref. jl72| certain types 
of random observables, as well as a class of non-r andom low ra nk observables. We 
first cove r the results of the type derived in Refs. 133 - 136 . [I0 | but concentrate on 
Ref. 


134| . as it goes beyond the rather unrealistic scenario of Hamiltonians with Haar 


random eigenstates. 

As a warm-up, we shall, however, consider exactly the situation of Hamiltonians 
with Haar random eigenvectors. First, we define what a Haar random Hamiltonian is: 
Consider a system with Hilbert space TL of dimension d and fix an observable G G 0{'H). 
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Hg{U) :=UGU^ 


(63) 


is a Haar random Hamiltonian. Of course, G and Hg{U) share the same spectrum 
and eigenvalue multiplicities for any unitary U, but the energy eigenstates / spectral 
projectors of Hg{U) are Haar random. Fixing G is thus equivalent to fixing the 
eigenvalues and degeneracies of the ensemble Hg{U), U ~ RHiiiir[U{d)] of Haar random 
Hamiltonians. 

A quantity that will play an important role in the theorems to come is 


fcit) ■- 


1 

d 



k=l 


(64) 


where is the sequence of eigenvalues with respective multiplicity of G (and hence 

also of Hg{U) for any unitary U). The function fc can be interpreted as the Eourier 


transform of the sequence {Ek)k=i 134 


We can now state the first result of Ref. jl^ . which concerns quantum systems 
composed of spin-1/2 systems, so-called qubits, i.e., quantum systems whose Hilbert 
space is C^: 


Theorem 5 (Equilibration under Haar random Hamiltonians 134 Result 1]). Consider 
a bipartite system consisting of |V| many qubits, i.e., V = S il B and Ti = 
with TL^x} = for all x &V, starting in a fixed initial state p(0) G S{'H). Then, for 
every G G 0{'H), every f G M, and every e > 0 if holds that 


II I ^ > 

[U (ti)] 


\/^ 

2e 




d'^ ds) 


9g 


< e, 


(65) 


where u:hg{u) ■= TrB($HG(t/)(p(0))) and go := maxk(.[d]\{l ■ Ei = Ek}\ with {EQf^^ the 
sequence of eigenvalues with respective multiplicity of G. 

A very similar result is also contained in Ref. |l^ . Essentially, Theorem [5] connects 
the temporal evolution of the trace distance of p^{t) from the equilibrium state uJq with 
the temporal evolution of IfciQl- If the bath is large and the Hamiltonian has only few 
degeneracies, then for most Haar random Hamiltonians the distance is 

small whenever Ifcit) \ is small. This will make it possible to give bounds on equilibration 
time scales. 

The above result can be extended to a more general ensemble of random 
Hamiltonians. More specifically, consider again the setting of a composite system of N 
qubits and the ensemble Hg{U), but now with G G 0{'H) diagonal in some product basis 
and U given by a random circuit of circuit depth G G Z+. Here, a circuit is a sequence of 
so-called quantum gates, i.e., unitary quantum channels that each act on only one or two 
qubits. The gates can be members of a so called universal gate set, i.e., a set of quantum 
gates such that any unitary can be approximated arbitrarily well by a circuit of gates 
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from this set. The circuit depth of a circuit is the number of gates in the circuit. Finally, 
a random circuit is a circuit in which the gates have been drawn randomly according 
to some measure from a universal gate set. We write fic for the measure on unitaries 
induced by random circuits of circuit depth C with gates drawn uniformly at random 
from some fixed, hnite universal gate set. It is known that limc^oo Rc = R}ia.ar[U{d)] and 
that for large enough C the me asur e {ic approximates fiHaariU (d)] in the sense of being 

This holds regardless of which hnite universal gate 


180 


an approximate unitary design 
set is used. 

For the random circuit ensemble of random Hamiltonians the following statement 
holds, which generalises Theorem |5l 

Theorem 6 (Equilibration under random circuit Hamiltonians 


134 Result 3]). 


Consider a bipartite system consisting of N := |V| many qubits, i.e., V = S U B and 
TL = with Ti{x} = for all x E V, starting in a fixed initial state p(0). 

There exists a constant a G M that depends only on the universal gate set such that for 
every G G 0{TL) diagonal in a product basis, every t G M, every circuit depth C G Z"*", 
and every e > 0 


P 


'v (pyi).<TO) > ^ (l/G(i)l‘ + f + ^ + 


1 / 2 '' 


< e, (66) 


where ■= TrB($HG((7)(p(0))) and go := maxfce[d]|{^: Ei = Ek}\ with {EQl^^ the 

sequence of eigenvalues with respective multiplicity of G. 

As can be seen from Eq. (|66l) . a slightly super-linear circuit complexity, i.e., 
G = G{N) ^ 0(iV), is sufficient to make the additional term in Eq. (I66l) (compared to 
Eq. (|65ll ) go to zero for large N. 

If this is the case, and in addition N is large enough, the bath is much larger than the 
subsystem, i.e., S> ds, and G has only few degeneracies, i.e., go -C d, then the right 
hand side of both Eq. fl65l) and fl66l) is approximately equal to |/g(^) P \/ds/(2 e). Hence, 
the bounds are non-trivial for reasonably small e for all t for which y/d^ |/(t)p -C 1. For 
which times t this is the case of course crucially depends on the spectrum that was hxed 
by hxing G. 

The spectrum of the Ising model with transverse held, for example, leads to an 
approximately Gaussia n de cay of |/(t)p, implying an estimated equilibration time of 
the order of jl34 |. For more general locally interacting Hamiltonians on D- 

dime nsional lattices one can show equilibration times of the order of 


135 


This means that given an initial state p(0), if G is chosen to be the Hamiltonian 
of the transverse held Ising model and U ~ RRaa.r[U{d)], then the dynamics under 
the Haar random Hamiltonian Hg{U), which has the same spectrum as G, is, with 
high probability, such that the time evolution p: M —>■ S{'H) is such that the state 
of any small subsystem S equilibrates to the reduced state of the de-phased state 
on that subsystem, during a time of the order of 0(A^“^/^). This, however, is in 














3 EQUILIBRATION 


32 


contradiction with the intuition that larger systems should take longer to equilibrate, 
simply because excitations in locally interacting spin systems travel with a finite speed 
(see also Section 1375]) . One would expect that for locally interacting systems of N spins 
on a D dimensional regular lattice with nearest neighbour or short range interactions, 
subsystem equilibration should happen on a time scale of the order of where 

is the linear size of the system^ for many reasonable initial states. 

Another point of criticism is that one can show that for Ha ar ra ndom Hamiltonians 
the subsystem equilibrium state is the maximally mixed state 1351 . Corollary 1] and a 
similar statement can be shown for the random circuit ensemble of random Hamiltonians. 
Systems to which the above results apply can thus never exhibit subsystem equilibration 
to an interesting, e.g., finite temperature, state. 

The reason for both of these problems is that neither the model of Haar random 
Hamiltonians nor that of Hamiltonians whose diagonalizing unitary is given by a random 
circuit with high circuit complexity are good models for realistic, locally interacting 
quantum systems. Simply put, even though random Hamiltonian ensembles have been 
successfully used to model certain f eatures of realistic Hamiltonians in the context of 
random matrix theory 69, 181 - 188| . the eigenstates of reasonable locally interacting 
quantum systems are far from Haar random. 


We hence turn to the results of Ref. 172l | for concrete Hamiltonians and 
measurements. The main result of that work is a bound on the equilibration time 
of low rank measurements, i.e., POVMs with two outcomes, one of which is a low rank 
projector. Such measurements do not correspond to local observables, but rigorous 
bounds on their equilibration behaviour can be given even for concrete situations; 

Theorem 7 (Fast equilibration of low rank observables 172j). Given a system with 
Hilbert space Ti and non-degenerate Hamiltonian H G 0{'H) with spectral decomposition 
H = X]fc=i \Ek){Ek\. For pifS) G S{Ti) the initial state of the system, letu = $j^(p(0)) 
he the de-phased state and define the energy level occupations pk '■= {Ek\p{0)\Ek) ■ Let 
A4 = {(H, 1 —H)} with H a rank K projector, then 

Vm (p(t),c^f < C- (77(1/T) (67) 

with (7 = 5 7r/(4 ■>/! — 1/e) + 1 and for any A > 0 

Pk- ( 68 ) 

' fc: E^,&[E,E+/l] 

One can now show that rj is lower bounded by l/d®®(ci;) but also argue that, up to 
reasonably large T, it holds that p(l/T) < C/T with C a constant that depends on the 
shape of the energy distribution of the initial state. In particular, the above theorem 
predicts an at least power-law like approach to equilibrium of low rank measurements 
on a time scale prop ortio nal to K/ge with ag the energy uncertainty in the initial state 
(for details see Ref. 172 |). 

In addition to the above results some lower bounds on equilibration time scales 
exist: For example, if a state has overlap only with energy eigenstates of the 


p(A) := sup 

EeM, 
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Hamiltonians in an energ y int erval of width A-E, then the equilibration time is at least 


of the order of 118| (see also Ref. 189|). Similarly, if the Hamiltonian if of a 


bipartite system with V = S' U R is uncoupled, except for a small coupling Hamiltonian 
Hr := H — Hs — Hb, then the equilibration time is at least of the order of f^(l/ H-Sf/Hoo) 


118l . Section 2.6.3]. Similarly, lower bounds on the equilibration and thermalisation 

- follow from bounds on the rate of change of 
lower bounds on the equilibration time of the 


time — as will be discussed in Section 
certain entropies 


125 


190 


In Ref. 


mJTl- 

(ml. 


type have been obtained for a class of spin systems with long range interactions. 

For spin systems with short range interactions, Lieb-Robinson bounds (see Section 13.5p 
immediately imply lower bounds on the equilibration time for certain initial states that 
are of the order of the linear size of the system. Finally, in systems whose density of 
state s can be approximated by a continuous function the Riemann-Lebesgue Lemma 
192 ] can be used to give upper bounds on equilibration time scales jd^ . Despite the 


large number of results the full problem still awaits a solution. 


3.7. Fidelity decay 

A scenario in which the equilibration behaviour has been studied in detail and is 
now particularly well understood is that of fidelity decay. Rather than looking at the 
expectation value of say a local observable the quantity, whose equilibration is of interest 
here is the hdelity between the initial state and the time evolved state at time t. For pure 
initial states '0(0) = |0)(0| G ^{'H) and unitary time evolution under a Hamiltonian 
H G 0{'H) the hdelity takes the simple form 






(69) 


which makes apparent that it can be seen as the square of the Fourier transform of the 
weighted energy distribution of the initial state. 

As 0(0) can be seen as a low rank (in fact rank one) observable, the result on the 
equilibration times of low rank observables (Theorem [7]) can be used to bound the time 
scales on which hdelity decays. Below this, typically power-law bound a rich variety of 


diherent decay behaviours can be observed |193 


195 


4. Investigations of equilibration for specific models 

There is a large body of literature studying equilibration dynamics of quantum many- 
body systems partly with analytical, but mostly with numerical methods. These works 
typically focus on a specihc model or a subclass of models. In this section we cover a 
selection of works in this direction. We will discuss many more works with a similar 
scope later in Section 17.61 once we have introduced the concept of thermalisation. 
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Often the behaviour of quantum systems after a suddenly altered Hamiltonian, a so 
called quench is considered. In this much discussed setting, the initial state p(0) is, 
e.g., the ground state of a locally interacting Hamiltonian and following the sudden 
quench to a different locally interacting Hamiltonian if, properties of 


p(f)=e-‘‘^p(0)e 


AtH 


(70) 


are explored. The seminal early study 


196 


introduces quenches to the literature 


and finds a “non-approach to equili brium ” in the XY model that is mapped to a 
quadratic fermionic system. Refs. ^ Il56| use field theoretical methods to gain insight 
into the dynamics of correlation functions after quenches. If the final Hamiltonian 
is close to being critical, notions of universality are being identified at long times. 
The early work ^ investigates an out of equilibrium phase diagram of the Bose- 
Hubbard model, arising from quenches from the superfluid to Mott phase. Ref. 
also considers out of equilibrium dynamics in the Bose-Hubbard model and discusses 
signatures of equilibration that can be pro bed using optical super-lattices. A similar 
setting is numerically analysed in Ref. 26|, which is then taken as a be nchmark for 


an experiment performing a dynamical quantum simulation. Ref. 197l | numerically 


investigates quenches inside the Mott phase with a method most suitable for lattices 
with high coordination number. Ref. 198| studies the relaxation dynamics in XXZ 
chains following a quench. Here, a rich phenomenology emerges and both oscillatory 
and exponential relaxation are being observed. Counter-intuitively, the relaxation speed 
increases at a critical point for the anisotropy parameter. The seminal experimental work 
also studies the on-equilibrium evolution of coherent states, being superpositions 
of different particle number states in a three dimensional optical lattice, observing 
collapses and revivals. A very powerful tool in numerical studies is provided by time- 
dependent variants of the density-matrix renorma lisation group (DMRG) approach and 


related tensor network approaches 32|, Il99l - l2ni| . An early example being Ref. 


77 


which studies the spreadin g of entanglement after quenches in Heisenberg spin chains. 
Noteworthy are Refs. 58|, |202| . which investigate equilibration with such methods in 
a setting describe d by the Bose-Hubbard model which can be realised with ultra cold 


atoms |26j. Ref. 203| uses time dependent DMRG methods to study the relaxation 
dynamics after quenches in the Tomonaga-Luttinger m odel and in systems of spin-less 
fermions and finds universal long time behaviour. Ref. 204| discusses the equilibration 
in the Bose-Hubbard and Fermi-Hubbard models following a global quench, employing 
an expansion in large coordination numbers. Ref. uses quantum Monte Carlo 
techniques to investigates the equilibration dynamics af ter s witching off the coupling of 
the Hubbard-model starting form a thermal state. Ref. j2n5| focuses on quenches in ID 
spin-less fermions. Ballistic transport is observed, except if the quench is from a metallic 
state deep into the insulating phase, in which case lo cal doma ins form reminiscent of 
the picture provided by the Kibble-Zurek mechanism 


206 


207 
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non-equilibrium dynamics emerging from bringing tw o sys tems together in a language of 
conformal and relativistic quantum field theory. Ref. 209| presents a detailed numerical 
study of the time evolution under various, integrable and non-integrable, translation 
invariant spin Hamiltonians for several types of initial states, including domain wall 
states with all spin in the left half up and all in the right half down. This situation 
can be thought of as a local quench. A similar setting is analysed in Ref. 22 1| for XXZ 
chains, where equilibration is also found, albeit to a state that retains memory on its 
initial state. Local quenches and the subsequent (quasi-particle) dynamics can now also 
be probed experimentally with impressive precision 2l| . 


4-3. Entanglement dynamics 

Early on, it has been realised that the light-cone like propagation of excitations following 
global quenches is accompanied by a growth of enta nglement if the initial state has low 
entanglement or even is a product state 7^, l76l-l78[ 11571. l222| . Entanglement is always 
dehned with respect to a separation of the system into distinct, spatially separate 
subsystems. Bipartite entanglement of pure states with respect to a decomposition 
V = X U for some subsystem X G V can be measured in terms of the entanglement 
entropy dehned for any state p as 


Ex{p) = S{p 




(71) 


where S denotes the von Neumann entropy. In a precise sense, this is the “unique 




measure of entanglement” in this pure bipartite setting (97| and we call a pure state p 
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uncorrelated (with respect to a decomposition V = X UX'^) if Ex{p) = 0. Other Renyi 
entropies 

(72) 

for p > 0, however, also play a role when it c omes to questions of approximations of 
states with tensor network methods 


223 . 224| . Lieb-Robinson bounds imply an affine 


upper bound for the entanglement entropy following global quenches: 

Observation 1 (Entanglement growth). For any locally interacting system with 
Hamiltonian H and any X <OV it holds that 


£x(pW)-BxWO))eO(i). 


(73) 


Conversely, there exist pairs of (translation invariant) locally interacting Hamiltonians 
H and pure uncorrelated initial states p(0) such that 


Ex{p{t)) ^ Ot{t). 


(74) 


Moreover, if p(0) is uncorrelated, then for t fixed and any family of subsystems X C V 
of increasing size the entanglement entropy scales only like the boundary of these subsets 
in the sense that 

Ex(p(t)) 6 OdA'al), (76) 

where Xq is the set of elements of the edge set £ of the Hamiltonian that overlap with 
both X and X^. 


Ehe first and last statements have been proven in R e fs. 74 . l78| and improved in 


801. The second statement follows from Refs. 


222 


2251. The intuition behind 


Ref. 

these statements is clear: Following the ballistic propagation of quasi-particles, at most 
a linear growth of entanglement over any finite cut can be observed. This indeed follows 
from suitable Lieb-Robinson bounds. Similarly, such a bound can be saturated for 
quadratic models, so it is tight in this sense. At the same tim e, for each fixed time f > 0, 
the entanglement entropy follows what is called an area law 226| for the entanglement 
entropy, in that the entanglement scales at most like the boundary area |Xg| of the 
subset X. 

There is a large body of literature ^at corroborates the intuition behind this 
theorem 


21 


76H80|. The early Ref. [77|, for example, st udies the spreading of 


entanglement aft er q uenches in Heisenberg spin chains. Ref. 222| studies quadratic 
models. Ref. 76, EH specifically develops the quasi-particle picture. 

The above notion of entanglement is not the only one that can and has been 
meaningfully considered. The correlations present in states arising from out of 
equilibrium dynamics can also be captured in terms of bipartite entanglement of 
separated subsystems which jointly still form a small subset of the entire lattice. Since 
the state under consideration is then no longer pure, other measures of entanglem ent 


971. |227| have to be employed, such as the entanglement of formation 228 


229 


or 
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the negativity 230l - l232| . Such entanglement measures have defining features of being 
monotone under local operations and classical communication (LOCC) and vanish on 
separable states as discussed in Eq. (j371) ). 


While it is known from the monogamy of entanglement 233| that at any time most 
sites of a lattice are not entangled, in the c ourse of e ntanglement dynamics, remote sites 
generically get entangled at suitable times 234j - l236| . The intuition is that “wave fronts” 
of entanglement propagate ballistically through the lattice. Such bipartite entanglement 
has already been experimentally observed in systems of trapped ions 


211 . 


Ramps, slow guenches and the dynamics of guantum phase transitions 

Many works discuss also non-instantaneous ramps and other instances of so-called 
slow guenches. In this context, the dynamics under a family of locally interacting 
Hamiltonians 

H{t) = Ho + f{t)V (76) 

is usually studied, with /: [0, oo[—)■ M being a suitably slowly varying function and p(0) 
the ground state of 77(0) = Hq. Such a situation is specifically interesting when at an 
instance in time to the Hamiltonian II{tQ) undergoes a second order quantum phase 
transition. If the change of the Hamiltonian in time is sufficiently slow, far away from 
the phase transition the adiabatic theorem will be applicable and the state p{t) is then 
well approximated by the instantaneous ground state of the Hamiltonian at that given 
time t. However, in the vicinity of the critical point, the Hamiltonian gap will close 
down, and no slow change of the Hamiltonian will be sufficiently slow such that the 
adiabatic theorem could still capture the situation at hand. This setting hence allows 
to explore the dynamics of guantum phase transitions. This review cannot give justice 
to this topic, which can be considered a research field in its own right. We still attempt 
to give a short sketch of important ideas. 

The Kibble-Zurek mechanism provides an intuitive _ u nder standing of the 


phenomenology of such slow quenches across critical points |2n6L 1207|. It is specifically 


well understood for t hermal p hase transitions, a setting in which it has also been 
experimentally tested j237 - 239 |. For quantum phase transitions similar scaling laws can 
be derived in the limit of infinitely slow ramps, invoking adiabati c perturb ation t heory 


and universality arguments but the situation is more involved 240l - l243| . Ref. 


244 


considers an exponential ramp from the superfiuid phase into the insulating one and 
calculates the time dependence of various experimentally relevant quantities for this 
case, and Ref. 


Ref. 


245| treats further analytically solvable ramps. 


246| experimentally probes the Mott-insulator to superfiuid transition in the 


Bose-Hubbard model by slowly decreasing the ratio of the interaction energy to the 


hopping strength. Ref. 4l| studies the Mott insulator to superfiuid quantum phase 


transition experimentally with ultra cold atoms and compares the findings to extensive 
numerics for the Bose-Hubbard model, using exact diagonalis ation and tensor network 
techniques. Also in the Bose-Hubbard model Refs. 2471. |248| study the formation and 
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249 


melting of Mott-insulating domains during ramps with tDMRG methods. Ref. 
analytically investigates hnite time ramps of th e inter-m ode interaction strength in a 
Luttinger liquid model. The series of works Refs. 25nl - l252| investigates the formation of 
topological defects after quenches that involve the breaking of a continuous rotational 
symm etry. For reviews on this field — to the extent it is understood to date — see 
Refs. 


206 


240 


253 


5. Quantum maximum entropy principles 

In this section we connect the pure state statistical mechanics framework with the 
canonical approach to justify the ensembles of statistical physics by means of a maximum 
entropy principle. We first show that the apparent equilibrium state in systems that 
equilibrate on average can always be dehned in terms of an entropy maximisation 
under the constraint that the expectation values of all conserved quantities are held 
fix. Then we will discuss the possibly surprising fact that in many cases, in particular 
following quenches of sufficiently complex quantum systems, equilibrium expectation 
values of many relevant observables are very well approximated by those in a state that 
is the entropy maximiser given a much smaller set of constants of motion — a so-called 
generalised Gibbs ensemble (GGE). 


5.1. A maximum entropy principle based on all constants of motion 


We have seen in Section 13.21 that if the expectation value of an observable or the 
reduced state of a subsystem equilibrates on average, then they necessarily equilibrate 
to their expectation value in, or the reduced state of, the time averaged/de-phased state 
oj = J> = $h(p( 0)). The state u hence encodes the information necessary to describe 
the equilibrium properties of such a system. It turns out that it is also the maximum 
entropy state given all constants of motion; 


Theorem 8 (Maximum entropy principle 254l |). Consider the time evolution p: M —)• 
S{TL) of a quantum system with Hilbert space TL and Hamiltonian H G 0{'H). If the 
expectation value of an operator A G B{'H) equilibrates on average, then it equilibrates 
towards its time average, given by 


Tr(A p) = Tr(y4ci;), 


(77) 


where oj = p is the unique quantum state that maximises the von Neumann entropy S, 
given all conserved quantities. 

Proof. That the equilibrium value of the expectation value of A is given by Tr(y4a;) 
follows directly from the dehnition of equilibration on average. The time averaged state 
OJ is equal to the de-phased initial state 

d' 

$h(p( 0)) = Ilfc p(0) Ilfc. 
k=l 


( 78 ) 
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Figure 4. (Reproduction from Ref. [120j) Dephasing implies a maximum entropy 
principle. A quantum system started in an initial state p(0) represented in panel (a) 
in an eigenbasis of its Hamiltonian H with degenerate subspaces corresponding to the 
squares, evolves (b) in a way such that time averaging its evolution (c) has the same 
effect as de-phasing the initial state with respect to H. The time averaged state p 
is the state that maximises the von Neumann entropy under the constraint that all 
conserved quantities give the same expectation value as in the initial state p(0). 


The de-phasing map $h is a so-called pinching and the von Neumann entropy is non¬ 
decreasing under pinchings 88|, Problem II.5.5] (this is a generalisation of Schur’s 
theorem). Furthermore, two states o'i,(T 2 G S{'H) yield the same expectation values 
for all conserved quantities, i.e., all A G 0{'H) that commute with the Hamiltonian 
[A,H] = 0, if and only if $_h'(cti) = $_h'(ct 2 ). This already shows that uj has the 
maximal achievable von Neumann entropy given all conserved quantities (see also 
hgure Uj). It remains to show uniqueness. Let B be a basis of the linear span of all 
A G 0{'H) with [A, H] = 0. The objective function of the maximisation problem, 
namely the von Neumann entropy, is a strictly concave function S: S{'H) —?• M and 
it is optimised over all a G S{'H) under the hnite number of affine equality constrains 
\/B G B: Yi{B a) = Tr(i?p(0)). Under the se conditions uniqueness follows from a 
standard result from convex optimisation j255| . □ 

Theorem [H] is very reminiscent of Jaynes’ maximum entropy principle. It is in 
any case remarkable that it is not, as in Jaynes’ approach, a postulate motivated by a 
subjective interpretation of probability that is taken as a starting point of a statistical 
theory, but a consequence of purely quantum mechanical considerations. The unitary 
quantum dynamics of closed systems alone gives rise to a maximum entropy principle. 

Theorem [H] is at the same time not the end of the story. It says that the equilibrium 
expectation values of all observables that equilibrate on average can be calculated from 
the state that ma ximis es the von Neumann entropy given all conserved quantities 
(compare also Ref. 256| ). The number of all linearly independent conserved quantities 
of a composite quantum system, however, increases exponentially with the number of 
constituents, and hnding each of them individually usually again requires resources that 
scale exponentially with the system size. The predictive power of Theorem |8] is hence 
rather limited. 
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In light of the insight discussed in the last section an interesting question to ask is 174| : 
“How many and which conserved quantities are actually relevant? Can one reasonably 
describe the equilibrium state by maximising entropy holding the expectation values 
of a much smaller number of possibly even efficiently obtainable conserved quantities 
hxed?” 

For many practically relevant locally interacting Hamiltonians, a number of 
conserved quantities can be identihed that are local in some sense. In fact, one of 
the possible dehnitions of quantum integrability (see also Section 19.2p is that there 
exists a number of conserved quantities scaling linearly in the system size. When such 
systems seemingly equilibrate to the time average state u under unitary dynamics, this 
time average can not be expected to be described by a Gibbs ensemble. The system 
may, however, still be reasonably expected to equilibrate to the i naxi mum entropy state 
given these suitably local constants of motion (see Refs. (3, bh?!. l258| and the references 
therein). SucI 
ensemble (GGE) (3^, 


33. 

42, 

64, 

112, 

259 

262 


More precisely, a conserved quantity is an observable A 6 0{'H) for which 
[H, A] = 0. Moreover, an operator A G 0{'H) is called local if it is only supported 
on some X G V, with |X| = K, for some constant K E N independent of the system 
size. 


Often, one considers conserved quantities that are approximately local. This notion 
can be made precise as follows. For simplicity, we restrict the attention to hnite 
dimensional spin systems. For a region X C V and I G Z+ denote by Xi the sets 


Xi:= {x eV: d{x,X) <l}, (79) 

of sites of the lattice that contain X as well as all sites within distance at most I from 
some site in X. For regions X C V dehne the map F^: H('H) —?■ B{'H) which acts on 
operators H G H('H) as 

Tx{A) := Trxc(H) G . (80) 

If supp(y4) C X, then Fx(R) = A, otherwise it die-cuts away everything of A that 
acts non-trivially outside of X, i.e., for any A G B{'H) and X C V it holds that 
supp(Fx(A)) C X. 

We call an operator {g, K)-local for some function g: Z+ —)■ M if there is some 
X C V of cardinality K such that 

\\A-rx,{A)\\^<\\A\\^g{l). (81) 

Often, the function g is taken to be exponentially decaying 

g{x) = Cl (82) 

for some suitable constants ci, C 2 > 0. We call operators A G B{'H) that are {g, X)-local 
in this sense approximately local (with exponential tails). Taking to be a step function 
gives the special case of (exactly) X-local operators. 
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Finally we define the notion of a quasi-local operator. Consider a translation 
invariant spin system, i.e., a system whose vertex set V corresponds to the sites of 
a regular lattice in a way such that its Hamiltonian H := Ylxe e invariant under 

a set of translations T^.: V —?■ V of the lattice, indexed by the element a: G V that 
is mapped to the hrst element of V, in the sense that for all x G V it holds that 
Tx{II) := Ylx&s ^Tj:(x) = H. Thereby, and slightly abusing notation, we have implicitly 
dehned the action of a translation T^. on o perators B{'H) in an obvious way. An operator 
A G H('H) is then called quasi-local j263| if it can be written in the form 

' ' XGV 

for some operator B G H('H) that is approximately local with exponential tails. 

We now give a precise dehnition of the generalised Gibbs ensemble in a way that 
seems useful in its own right: 

Definition 1 (Generalised Gibbs ensemble). Let K G TM and g\ TA M. be a suitably 
decaying function. Consider a system with locally interacting Hamiltonian H G 0{'H) 
and a setC of constants of motion that are either local, approximately local, or quasi-local 
(as defined above), or the eigenmode occupations if the Hamiltonian is quadratic. The 
generalised Gibbs ensemble (GGE) of the system for a given initial state p(0) G S{'H) 
is then defined as 


ctgge := argmax^e 5 (^) VA G C: Tr(A(T) = Tr(Ap(0))} . (84) 

Note that the resulting state is always of the form 

c^GGE oc e“ ^ ( 85 ) 

with generalised inverse temperatures (Lagrange multipliers) (3a ^ 1^, one for each 
conserved quantity A G C|v|. It is often clear what suitable sets of constants of motion 
are, for example in quenches to integrable systems. In situations in which th ese constants 


of motion are ambiguous, the GGE inherits the same ambiguity 264j - l267| . As pointed 


out in the previous subsection, if all constants of motion are taken into account, then 
all hnite dimensional syster ns eq uilibrate on average to the respective GGE if they 
equilibrate on average at all 254 1. 

There is a large body of evidence that integrable quantum models indeed equili brate 
to a suitable generalised Gibbs ensemble in this se nse . The wo rks Refs. 33|, l42|, Ill2| 


noticed the signihcance of the GGE early on. Refs. GJ, l268l. |269| discuss the behaviour 
of the one- and two-point correlation functions after a quench in various models, and 
it is found that the relaxation dynamics and equilibrium values can be well understood 
by mea ns o f a GGE. The validity of the generalised Gibbs ensemble is also studied 
in Ref. j26ll |. where in particular, a que nch of one-dimensional hard-core bosons in an 
optical lattice is considered, and in Ref. 270| for a spin-1/2 Heise nberg XXZ chain with 


an anisotropy for various initial states, followed up by Ref. (260|. Ref. 27l| develops a 
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general method based on Wick’s theorem that allows to show that the GGE correctly 
captures the equilibrium properties in certain quenches in solvable systems starting in 
thermal states, such as certain Ising chains, the Luttinger model, ID hard-core bosons, 
and XX spin chains, and explains how quasi-particle occupations suffice to construct 
the GGE in these systems. Ref. 272| follows up on this b y co nsidering situations 


with interacting pre-quench Hamiltonians. In contrast. Ref. 273| considers a quench 


from quadratic to inhnitely strongly interacting (hard core) bosons and obtains exact 
results on t he t ime evolution and shows that the equilibrium state is described by a 
GGE; Ref. 274| follows up on this by analytical investigating how a trap ping potential 
influences the equilibration dynamics and equilibrium properties. Ref. (275| analyses 
the GGE in the Lieb-Liniger model. Ends potentially observable differences between 
the GGE and the grand canonical ensemble, and highlights that the GGE can turn 
out t o be ill-dehned if the initial expectation values of conserved quantities diverge. 
Ref. 276| discusses an interaction quench in the Lieb-Liniger model where the GGE 


implementation is not well dehned and where also the idea of the representative state has 
been tested for the hrst time in a truly interacting model where no use of the conserved 
charges was made. Ref. j259| is generally concerned with integrable models. In this work, 
the concept of a “representative Hamiltonian eigenstate” is introduced and it is shown 
how to construct it efficiently by means of a generalised thermodynamic Bethe ansatz. 
For long times, the equilibrium values of local observables after a quench are given 
by this “representative Hamiltonian eigenstate”. A framework for geometric que nche s 
in int egrable inodels based on the algebraic Bethe ansatz is developed in Ref. 211 [. 
Refs. 264 l266l. |277| study the crucial question of which constants of motion need to 


be included in the GGE to make it correctly reproduce the post quench equilibrium 
state. For certain quenches in an XXZ model and attractive Lieb-Liniger model they 
End that even if all known local conserved quantities are included the GGE it still fails 
to reproduce the equilibrium expectation values of even some local observables. It is 
conceivable that this is an indication that the model mi ght h ave more (quasi-)local 
conserved quantities. Finding them all is a non-trivial task |278| . Trying to identify the 
physically most relevant observables. Ref. 267| proposes to rather try to End the best 
possible approximation to the dephased state uj with an ansatz of GGE form (|85|) with 
as few observables as possible and exempliEes that this allows for example to capture 
the dephased state of a locally interacting fermionic system much better than with the 
standard G GE co ntaining only mode occupations. 


In Ref. 


279| the ability of the GGE to capture scenarios with repeated quenches 




is explored. Ref. |63| collects evidence that the GGE can correctly capture the long 
time limit of the expectation values of local operators in certain integrable models. 
Ref. 60| investigates diEerences between the inEnite time time averaged equilibrium 


state of a translation invariant Enite system and the inEnite time limit of the state 
of the corresponding inEnite system and whether their properties are captured by the 
GGE and which role is played by local co nser ved quantities common to both the initial 
and Enal Hamiltonian of a quench. Ref. 280| proposes generalised form factors for the 
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analysis of correlation fnnctions in generalised Gibbs ensembles. 


6. Typicality 

We have np to now managed to avoid the introdnction of ensembles, or as one conld 
say not put any probabilities by hand. However, ensembles and averages with respect 
to certain postnlated probability distribntion do play important roles in statistical 
mechanics. In this section we review some argnments that can be nsed in the framework 
of pnre state statistical mechanics to jnstify their nse. These approaches to explain 
the applicability of statistical mechanics are based on the insight that nnder certain 
assnmptions most individnal instances of a sitnation lead to a behavionr that is very 
similarly to the average or typical behavionr. 


6.1. Typicality for uniformly random state vectors 


We begin by reviewing the most infln entia l articles on the snlbect in historic order, 


starting with the works of Schrodinger |281| and von Nenmann |3[. We will then state. 


Lpiec 


prove, and discnss a general typicality theorem for nniformly random qnantnm state 
vectors. We hnish this section with a discnssion of typicality in other ensembles and the 
most common objections against typicality argnments. 

The strategy behind jnstifications for the nse of ensembles is to argne that most 
states drawn according to some reasonable measnre from a set of physically reasonable 
states have approximately the same properties, so that for compntations it is practical 
to work with an average state. This average state can, for example, tnrn ont to be the 
state corresponding to a micro-canonical or canonical ensemble. 

The nse of snch typicality arguments in the fonndations of qnantnm statistical 
mechanics has a long histo ry. First considerations along these lines already appear 
in a work by Schrodinger j28ll| from 1927. After an introdnction into (first order) 
pertnrbation theory and a discnssion of resonance phenomena in qnantnm mechanics 
with a focns on energy exchange in weakly interacting systems he goes on discnssing 
what he calls a “statistical hypothesis’!^. He aims at describing the long time behavionr of 
weakly interacting systems hoping to find thermodynamic behavionr. More specifically, 
he considers two systems that each have a pair of energy levels with the same gap. 
The conpling between them that mixes the levels is assnmed to be weak. As his 
previons calcnlation had shown that the time averaged state depends on the initial 
state, he proposes to make an assnmption abont the initial energy level popnlations. 
His assnmption is that the popnlations of the levels are proportional to the prodncts 
of the degrees of degeneracy of the non-interacting levels. By introdncing an entropy 
like qnantity, he argnes that if one of the systems is snfficiently large, this implies that 
when popnlations of energy levels whose rednced states on the small system are almost 
identical are combined, then the combined popnlations satisfy a canonical distribntion. 


§ German original 281|: “Statistische Hypothese”. 
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By this, he effectively argues that initial states fulffling his “statistical hypothesis” have 
reduced states on the small system that are well described by thermal states. 

The concept of typicality is even more prominent in a n article by von Neumann 
from 1929. His work has been translated by Tumulka 13ll and reviewed and refined by 


Goldstein, Lebowitz, Mastrodonato, Tumulka, and Zanghi 132| . Von Neumann sets out 
to clarify “how it can be that the known thermodynamic methods of statistical mechanics 
enable one to make statements about imperfectly (e.g., only macroscopically) known 
systems that in most cases are correct .’{JJ] He goes on to describe that this means to clarify 
“first, how the strange, seemingly irreversible behaviour of entropy emerges, and second, 
why the statistical properties of the (fictitious) micro-canonical ensemble can be assumed 
for the imperfectly known (real) systems, and that these questions will be tackled 
with the methods o f qu antum mechanics.’!^ He further argues that the phase space 
of classical ^sterns 282| . a central object in Gibbs’ formulation of classical statistical 
mechanics (l|, should, in the context of quantum mechanics, be replaced by a system of 
mutually commuting macroscopic observables that approximate the true non-commuting 
quantum observables. Each sequence of eigenvalues of all macroscopic observables 
is associated with a phase cell, i.e., the subspace spanned by the state vectors that 
all give precisely these measurement outcomes for the macroscopic measurem ents, but 


which are macroscopically indistinguishable from each other. Following Ref. 132 


we 


denote the projector onto the phase cell characterised by the sequence u of macroscopic 
measurement outcomes by P^y. The approximation of the microscopic observables is to 
be taken coarse enough, such that, for example, the commuting macroscopic position 
and momentum observables do not get in conflict with Heisenberg’s uncertainty relation 
for the true microscopic position and momentum operators. One of von Neumann’s 
main results is his “quantum ergodic theorem”El. Essentially, he is a ble t o show the 


following (for details see the original article and Theorem 1 in Ref. 132j): Fix the 
dimensions rank(Pj,) of the phase cells, if they are all neither too small not too large, 
then for any fixed Hamiltonian without degeneracies and non-degenerate energy gaps 
(see also Section [3.2j) . most decompositions of the Hilbert space into phase cells with 
these dimensions have the property that, for all initial states and most times during 
the evolution, the evolving state of the system and a suitable micro-canonical state 
are approximately macroscopicall y ind istinguishable. This property is called “normal 
typicality” by the authors of Ref. 132| . The result can actually be slightly generalised 
(Theorems 2 and 3 in Ref. jl32 |) and von Neumann’s theorem can be reformulated 
into a statement about all initial states, all decompositions into phase cells, and most 


II German original [3|: “[...] wie es kommt, dass die bekannten thermodynamischen Methoden der 
statistischen Mechanik es ermoglichen, iiber mangelhaft (d.h. nur makroskopisch) bekannte Systeme 
meistens richtige Aussagen zu machen.” 

^ German original [^: “Insbesondere, wie erstens das eigentiimliche, irreversibel scheinende Verhalten 
der Entropy zustande kommt, und warum zweitens die statistischen Eigenschaften der (fiktiven) 
mikrokanonischen Gesamtheit fiir das mangelhaft bekannte (wirkliche) System unterstellt werden 
diirfen. Und zwar sollen diese Fragen mit den Mitteln der Quantenmechanik angegriffen werden.” 
German original Q: “Ergodensatz [...] in der neuen Mechanik”. 
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132 


concerns not the 


Hamiltonians |132| . 

It is worth noting that the notion of typicality in Refs, 
quantnm state (vector) but the set of macroscopic observables. The statement holds 
for most decompositions of the Hilbert space in phase cells (with certain properties), 
or most Hamiltonians, but for all initial states. In the following, typicality will mostly 
concern the quantum state (vector), i.e., we will encounter statements that hold for 
most (initial) state vectors. 

Typicality arguments feature prominently in the PhD thesis of Lloyd (see also 
Ref. (Slj). Essentially he shows that for any hxed observable, if quantum state vectors 
are drawn uniformly at random from a subspace of a Hilbert space (we will soon make 
this more precise), then the mean square deviation of the expectation value of the 
observable in such a random state from that in the corresponding micro-canonical state 
is inverse proportional to the dimension of the subspace. 

In a similar spirit, the concept of typicality is a cornerstone of the arguments in 
the book by Gemmer (69|. As a measure of typicality the authors propose the Hilbert 
space variance and derive bounds for the Hilbert space variance of various physically 
interesting quantities, ranging from expectation values of observables and distances of 
reduced states to entropies and purities. As in Ref. ^ and the present work, the aim 
is to use typicality to justify the methods of statistical mechanics and thermodyna mics . 

Many of the ideas of the works summarised above have la ter re appeared in Ref. j283| 


in which the term canonical typicality was coined. Ref . 283| is intended to be a 


clarihcation and extensio n of the work of Schrodinger |281| . which we discussed earlier, 
and remarks in his book 284| on statistical thermodynamics. After a translation of the 


classical proof of the canonical ensemble from the micro-canonical one to the quantum 
setting, the authors argue that the law of large numbers implies that if a state vector 
is drawn uniformly at random from a high dimensional subspace, its reduced state on 
a small subsystem will look similar to the reduced state of the micro-canonical state 
corresponding to that subspace. 

Before we go on, we must say more precisely what we mean by drawing a state 
vector uniformly at random from a subspace. Intuitively it should mean that any state 
from the subspace is as probable as any ot her. Mathematically t his is made precise in 
the notion of left/right invariant measures j285| . Haar’s theorem 2861 implies that for 
any hnite d there is a unique left and right invariant, countably additive, normalised 
measure on the unitary group U{d) j285| . We refer to this measure as the Haar measure 
on U{d) and denote it by pinaa.r\U {d)]. Left and right invariant means that for any 
unitary U & U{d) and any Borel sei AS <TU{d) 


Tn^r[U{d)]{AS) = frn^r[U{d)]{U^) = yin..r[U{d)]{^ U), 


( 86 ) 


where U AS and ASU are the left and right translates of In this sense, the Haar 
measure /iHaar[f^(d)] is the uniform measure on U{d). 

The Haar measure on the group of unitaries that map a (restricted) subspace 
nR C of dimension Sr into itself induces in a natural way a uniform measure 






















6 TYPICALITY 


46 


AiHaar[’Hij] Oil State vectois I'lp) G TLr. We call state vectors drawn according to this 
measure, and also pure quantum states drawn according to the natural induced 

measure, Haar random and write l"^) ~ /iHaar[‘H_R]. 

A practical way to obtain state vectors distributed according to this measure is to 
£x a basis (|j))j=i for the subspace TLr and then draw the real and imaginary part of 
dR complex numbers from normal distributions of mean zero and variance one. 

The state vector 


is then distributed according to he., |i/^) ~ /iHaar 

probability that an assertion Ad-^)) about a state vector \'il) 


[Hr] |287 


is true if 


(87) 

We denote the 

i>) ~ hHaarl'Hij] 


In the framework of measure theory 


285 


of the phenomenon of measure concentration 1151 Ill6| . In particular a result known 


typi cality can be seen as a consequence 


as Levy’s lern ma, has been used in Refs. 68|, Ill7| to obtain theorems in the spirit of 
Refs. 671 . l283| . but with stronger bounds on th e pro babilities to observe large deviations 


from the (micro)canonical ensemble. Refs. 68|, IllTf focused mainly on reduced states of 


small subsystems of states dr awn at random from high dimensional subspaces. Based on 
the same techniques, in Ref. 118| . similar results have been obtained for the expectation 
values of individual observables on the full system as well as their variances, and for sets 
of commuting observables, developing further ideas of Ref. concerning macroscopic 
measurements. 

Furthermore, an extension to the distinguishability under a restricted set of POVMs 
is possible. We summarise these results in a single theoren i, wh ich, however, is not 
optimal in terms of constants and scaling (compare Refs. 68|, Ill8| for details). 

Theorem 9 (Measure concentration for quantum state vectors). Let i? C M and let 
TLr ^TL be the subspace of the Hilbert space TL of a system with Hamiltonian H G 0{TL) 
that is spanned by the eigenstates of H to energies in R and let dR := dim('HR). Then 
for every e > D it holds that (i) for any operator A G B{TL) 

{A)n[Hm \>e) <2 ^ (gg) 


lb)' 




and (a) for any set A4 of POVMs 


lh>' 


■^MHaar [Hr] 


,n[H]{R)) > e) <2h{M) 


2 ^-CdReRhiMf 


where C = 1/(36 vr^) and 


h{M) ;= min(|UAl|,dim('Hsupp(M)))- 
is Theorem 2.2.2 from Ref. 


(89) 


(90) 


118| . 


Proof. Eq. 

equal to the second argument of the min in Eq. 
remember that then for all p, <7 G S{TL) 


We now prove Eq. fl89l) for h{M.) 
Let S ;= Uj^^gLJA^supp(M) and 


(p, (t) <V (p®, a^) 


(91) 
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Then Eq. (75) in Section VI.C of Ref. |68| yields the result. To finish the proof, note 
that Eq. (IT^ implies that for any p, a G 5(77) 

^ \M\ 

Dm (p, (t) ■= sup - V] I Tr(Mfc p) - Tr(Mfc (t)| ( 92 ) 

<1 |lV(Afrt-'IV(Mff)| (93) 

M£UM 

<^|UA7| sup |(M),-(M).|. (94) 

^ MGUA^ 

Together with Boole’s inequality this yields that for every a G 5(77) 


P {T^m 

aar [Hr] 

<1 - P 

aar [Hr] 


, > e) 


\M£UM 


P 

IP)~m aar [Wfl] 


U KV)i*xv.|-<v),l> 




E 


< > P 

MeUM 


26 \ 

^ \^M\) 

2e 

|UM| 

2e \ 


I(^)|PXPI (^)o-l ^ |u_A4| J ■ 


(95) 

(96) 

(97) 


The proof of the result for h{JYl) equal to the first argument of the min in Eq. (I90l) 
can then be finished by choosing a = n[77](72), using Eq. (|88l) . and the fact that for all 
M G UAA it holds that ||M||^ < 1. Disregarding a favorable factor of 2 and using the 
(highly non-optimal) bound |UA4| < |UA4p yields the unified result as stated in the 
theorem. □ 


A physically particularly relevant case is when supp(A7) is contained in some small 
subsystem 5 D supp(A7) and R = [E,E + A] is some energy interval. Then the 
theorem yields a probabilistic bound on the distance 'D(|'0)('0|'^, n'^[77]([i7, 77 + A])). 
If 1-0) ~ PHaar[77R] and the dimension cIr of the micro-canonical subspace TLr to the 
energies in the interval [77, 77 -|- A] fulfils 3> ds, then , □'^[77]([77, 77 -|- A])) 

is small with very high probability. That is, the reduced state on 5 of a random state 
from the subspace corresponding to the energy interval R is indistinguishable from the 
reduction of the corresponding micro-canonical state, with high probability. 

The same holds in the more general setting that one has access only to a sufficiently 
small number of measurements, which in total have a sufficiently small number of 
different outcomes. If the total number of different outcomes | U A41 is much smaller 
than the dimension of the subspace corresponding to the energy interval [77,77 -|- A], 
a random state from this subspace is with high probability indistinguishable from the 
micro-canonical state. 

For a family of Hamiltonians of locally interacting quantum systems with increasing 
system size, if A is kept fix and 77 is chosen such that R = [77,77 -|- A] is not 
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too close to the boundaries of the spectrum of the Hamiltonian, then typically 
grows exponentially with the system size |V|. For a locally interacting system with 
a macroscopic number of constituents one would thus need to be able to distinguish 
an astronomically large number of different measurement outcomes to have a realistic 
chance of distinguishing a random state from a micro-canonical state. 


Similar methods as those used above were employed in Ref. 107| to prove that 
for Haar random pure states from high dimensional subspaces the effective dimension 
(which we encountered in Section 13.21) with respect to a fixed Hamiltonian is of the 
order of the dimension of the subspace, with probability exponentially close to one. The 
result can be generalised to cert ain measures over states that are product with respect 
to a bipartition V = S U B 


118 


6.2. Typicality for other measures over quantum state vectors 

In addition to the Haar measure, other measures over quantum state vectors have been 
considered in the literature: This has been done in order to incorporate meaningful 
physical constraints into notions of typicality. Refs. 288 - 293| introduce the mean energy 
ensemble. Instead of the uniform measure on a subspace corresponding to some energy 
interval, the mean energy ensemble consists of random state vectors which have a fixed 
energy expectation value with respect to some given Hamiltonian H. Under certain 
conditions on the spectr um o f H it can be shown that the mean energy ensemble exhibits 
measure concentration 29l| . In addition to that, it is possi ble t o identify the typical 


reduced state of states drawn from the mean energy ensemble 29 1| . and it can be shown 
that under certain condi tion s states from the mean energy ensemble typically have a 
high effec tive dimension 


Ref. 


118 


294| considers an ensemble of quantum state vectors of the form given in 


Eq. fl87j) . in which the expansion coefficients Cj = (j|'0) have fixed modulus but random 
phases. Concentration results, similar in spirit to Theorem [9l can be shown for this 
ensemble that yield typicality whenever sufficiently many energy levels are populated. 


Ref. 295| extends the notion of typicality to the dynamics of systems. Similarly as 


in the mean energy ensemble, the authors define an ensemble of initial states that share 
the same expectation value with respect to some given observable and then investigate 
the time evolution of this expectation value under a Hamiltonian. The authors find 
dynamical typicality^ i.e., that states that initially give similar expectation values also 
typically lead to a similar dynamical evolution of these expectation values. 

Typicality can also be used to speed up numerical calculations. Instead of sampling 
over exponentially large sets of states, often drawing just a few r epres entatives can 


already be sufficient to estimate expectation values 296| . Ref. 297| for example 


introduces the concept of minimally entangled typical quantum states, which, given 
a Hamiltonian H and inverse temperature /?, constitute an ensemble of pure states 
whose average corresponds to the thermal state of H at inverse temperature (I. The 
ensemble can be used to more efficiently calculate for example thermal expectation 
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values of ob servables. A related approach, which has recently been put forward in 
Refs. [^ . [ 2 ^. |299| . is to investigate and exploit typicality in the context of so-called 
matrix product states. The effects of typicality allow for the numerical approximation 
of thermal ex pec tation valu es o f observables in situations where naive approaches 


T—I 


are infeasible m . In Ref. |300| a method for numerically checking the validity of 
the eigenstate thermalisation hypothesis (see Section I7.2p is proposed that exploits 
techniques to apply exponentials of operators to random pure states. Typicality ensures 
that only few such random states are needed to obtain conclusive results, thereby vastly 
reducing the computational cost. 


Typic ality arguments are sometimes dismissed for being “unphysicaf' 


301, soa 


Ref. 132| . for example, contains a very interesting review of the mostly negative 
reception of von Neumann’s quantum ergodic theorem (see also Section [6]). Whether the 
concept of typicality is really superior to other approaches towards the foundations of 
statistical mechanics and thermodynamics, such as ergodicity, the principle of maximum 
entropy, or postulating ensembles, is to some extent a matter of personal taste. However, 
especially with respect to the latter, typicality has at last one important advantage: 
Instead of simply postulating that a certain ensemble yields a reasonable description of 
a certain physical situation, typicality shows, in a mathematically very well-defined way, 
when and why details do not matter. If most states anyway exhibit the same or very 
similar properties, then this does provide a heuristic, but pretty convincing, argument in 
favour of the applicability of ensembles. It is hence an argument supporting a description 
of large systems with ensembles. 


7. Thermalisation 

Given the findings presented in the last sections a natural question to ask is; When do 
closed quantum systems in pure states that evolve unitarily not only equilibrate, but 
actually thermalise in the sense that under reasonable restrictions on the experimental 
capabilities they appear to be thermalised or in thermodynamic equilibrium? 

To make this question meaningful we will dehne the term thermalisation in this 
section. Then, in Section 17.21 and 17.31 we will discuss two general complementary 
approaches to explain and understand thermalisation in the framework of pure state 
quantum statistical mechanics in detail. The first approach is the so-called eigenstate 
thermalisation hypothesis (ETH), the second is based on a quantum version of the 
classical derivation of the canonical ensemble from the micro-canonical one, augmented 
with rigorous perturbation theory. The first approach is based mostly on assumptions 
on the eigenspaces/eigenstates of the Hamiltonian, while the second one instead 
requires stronger assumptions on the initial state. We then turn to a discussion 
of thermalisation in locally interacting translation invariant systems and a result 
concerning the equivalence of the canonical- and micro-canonical ensemble in Section fTTl 
It is possible to interpolate between ETH approach and that based on assumptions 
on the initial state to some extent. We say more on that and on alternative notions 
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of thermalisation in Section 17.51 We finish by surveying numerical investigations of 
thermalisation and analytical results concerning concrete model Hamiltonians or specific 
classes of systems in Section 17.61 

Throughout this section a focus will be put on subsystem thermalisation, i.e., the 
thermalisation of a small part (subsystem) of a large composite quantum system via the 
interaction with the rest of the system (bath). The whole composite system (subsystem 
and bath together) is thereby assumed to be in a pure state evolving according to the 
standard (Schr5dinger-)von-Neumann-equation under some Hamiltonian H. Let S' C V 
be the vertex set of the subsystem and B = that of the bath, then we will call 
the sum Hs + Hb —■ Hq oi the two restricted Hamiltonians Hs and Hb the non¬ 
interacting Hamiltonian and Hj := H — Hq the interaction Hamiltonian. We will say 
that a Hamiltonian H is non-interacting ii H = Hs + Hb- 

Whenever the term bath is used in the following it refers to this model of 
thermalisation. In particular we do not mean quantum systems that are already initially 
in a thermal state or other models of heat baths. It is crucial to note that approaches that 
explain thermalisation in quantum systems by investigating the behaviour of systems 
coupled to such thermal baths cannot solve the fundamental problem of thermalisation, 
as they leave open the question how the thermal bath became thermal in the first place. 

7.1. What is thermalisation? 

Whenever a term from one theory is used in a different context, a proper definition 
is mandatory. This is particularly true for terms as involved as thermalisation and 
thermodynamic equilibrium which, already in classical statistical mechanics, have several 
different meanings depending on the context. To take account of the complex nature 
of the term thermalisation we will not jump directly to a definition. Instead, we will 
consider a number of conditions that each capture certain aspects of thermalisation and 
whose fulhlment, depending on the context, one might or might not find necessary to 
say that a system has thermalised. 

The catalog of properties that we will consider has been chosen with the setting 
of subsystem thermalisation in mind. Based on this discussion we will then carefully 
define what we consider sufficient to call a (sub)system thermalised, leaving open the 
possibility of defining other, possibly less strict, notions of thermalisation. In addition 
to that, we will also define the term subsystem initial state independence, a property 
that we regard as a necessary prerequisite for the thermalisation of subsystems, and 
which we will discuss in more detail in Section [HI 

The aspects of thermalisation that we will use as a guideline for our definition of 
thermalisation are: 

(i) Equilibration-. Equilibration is generally considered to be a necessary condition 
for thermalisation. In the following we will mostly be concerned with subsystem 
equilibration on average and apparent equilibration on average of the whole system 
under restricted sets of POVMs (see also Section I3T]) . 
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Subsystem initial state independence: The equilibrium state of a small subsystem 
should be independent of the initial state of that subsystem. If a system exhibits 
some local exactly conserved quantities then one might still call it thermal and 
desc ribe its eq uilibrium state by, for example, a so-called generalised Gibbs ensemble 
42, 303, 13n4j |. However, even such a behaviour is often already considered to be 
non-thermal. We will take the more cautious point of view that a system should 
not be considered thermalising if its equilibrium state depends on details of its own 
initial state, despite the absence of local exactly conserved quantities. 


(iii) Bath state independence: It is generally expected that the equilibrium expectation 
values of local observables on a small subsystem are almost independent of the 
details of the initial state of the rest of the system, but should rather only depend 
on its “macroscopic properties”, such as the energy density, which one would expect 
to have an influence on the temperature of the thermalising subsystem. 


(iv) Diagonal form of the subsystem eguilibrium state: The equilibrium state of a small 
subsystem should be approximately diagonal in the energy eigenbasis of a suitably 
defined “self-Hamiltonian”. If the interaction with the bath makes the state of the 
subsystem approximately diagonal in some basis one could call this decoherence. 

(v) Gibbs state: Ultimately, one would like to recover the standard assumption of 
(classical) statistical physics that the equilibrium state is in some sense close to a 
Gibbs/thermal state. 


In the light of Condition ([i]) it seems to be a sensible approach to define 
thermalisation on average or during an interval depending on the type of equilibration 
that goes along with thermalisation. Conditions ([n]) and (jm)) make clear that 
thermalisation should be defined with respect to sets of initial states. This leads us 
to the following definition of thermalisation: 


Definition 2 (Thermalisation on average). We say that a system with Hilbert space H 
and Hamiltonian H G 0{'H) thermalises on average with respect to a set A4 of POVMs 
and for a given set of initial states Sq C S{'H) if for each state p(0) G iSq, the system 
apparently eguilibrates on average to an eguilibrium state u = $h{p{0)) that is close 
to a thermal state g[H](^/3(TT{H p(0)))) for some Hamiltonian H in the sense that for 
some suitable function /5: M —)■ R the distinguishability g[H][(d(fTT{H p(0))))) is 

sufficiently small. 

Definition [2] implicitly also defines thermalisation on average of subsystems. Just 
choose AA to be the set of all POVMs with support on a subsystem S' C V and H = Hs. 
If on the contrary AA. contains POVMs whose support covers the whole system, then 
H = H is a natural choice. Moreover, in practice one would probably want that the 
function (3 has some physically nice properties, like being smooth and or monotonically 
decreasing. Thermalisation during intervals can be defined equivalently, but as we will 
not discuss it here, we keep the definition as simple as possible. 

It seems worth emphasising again that the above definition does not say that a 
system thermalises if and only if the given conditions are met, but only says we call it 
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thermalising if at least the given conditions are met. It gives a set of conditions that are 
sufficient for thermalisation. In addition it shall be noted that for the case of subsystem 
thermalisation with a small subsystem our definition of thermalisation implies that the 
equilibrium state of the subsystem must be nearly independent of the subsystems initial 
state. We discuss subsystem initial state independence in more detail in Section ISTTl 

An obvious question to ask now is: What are reasonable sets Sq of initial states? 
Particularly important is the energy distribution of the initial states, i.e., the sequence 
{pk)k=i of fho energy level populations pk '■= Tr(nA; p(0)), as it is conserved under time 
evolution. Taking the classical derivation of the canonical ensemble from the micro- 
canonical one as a guideline, thermalisation can only be expected to happen for initial 
states whose energy distribution is not too wide, i.e., the energies of the significantly 
populated levels must be in an interval small compared to ||Tf||oo. We will see in 
Sections 17.21 and 17.31 that such a condition will play an important role in proofs of 
thermalisation. 

In the above definition of thermalisation on average we deliberately left open the 
question of what “sufficiently small” means. This is ultimately to be decided in the 
specific situation at hand. One would probably want that g[H]{/3)) somehow 

suitably decreases with the size of the system. However, we want to have a definition 
of thermalisation that is applicable to finite systems. Moreover, we want to avoid the 
technicalities of defining thermalisation for sequences of quantum systems of increasing 
size. 


1.2. Thermalisation under assumptions on the eigenstates 


At the center of the first approach to show thermalisation in quantum systems is the 
eigenstate thermalisation hypothesis (ETH). There exist various version of the ETH in 
the literature and we will give a more precise definition below, but a minimal version 
of the ETH can informally be phrased as follows: “A Hamiltonian fulfils the ETH 
if the expectation values of physically relevant observables in its energy eigenstates 
are approximately smooth functions of their energy.” As we will see in this section, 
observables for which a system fulfils the ETH thermalise on average under re asonable 
conditions. The ETH is usually said to date back to the two works 189 . IdOhl . As the 
role of these works is, however, often misunderstood it is worth starting this section 
with a short historical review: 


Already in 1985 Ref. ^ investigated numerically how relatively small quantum 
systems equilibrate to a state that can be well described by statistical mechanics. The 
computational power available at that time made it possible to study a spin-1/2 Ising 
chain with up to seven sites in a transverse field by means of exact diagonalisation. 
Ref. investigates the equilibration behaviour of both global and local observables 
and compares time averages with micro-canonical and canonical averages. The authors 
conclude that “both integrable and non-integrable quantum systems with as few as seven 
degrees of freedom can exhibit statistical behaviour for finite times.” They also describe 
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the reason for the statistical behaviour they observe, which is essentially the mechanism 
that is today known as the ETH; “If the expectation values [of an observable in the 
energy eigenstates] are smooth functions of the energy [...], then the short-time average 
of the observable will be very close to the ensemble average.” In fact, it seems fair to 
say that the authors did anticipate large parts of the recent debate on equilibration 
and thermalisation in closed quantum systems. The last sentence of the abstract for 
example reads “This work clarifies the impact of integrability and conservation laws on 
statistical behaviour. The relation to quantum chaos is also discussed.” It is remarkable 
that Ref. ^ is nevertheless essentially completely igno red by almost the whole recent 


literature ce ntred around such questions (Refs. |4fil . I3n6| being notable exceptions) 


In Ref. 


305| a mechanism that can lead to the thermalisation of quantum systems 


is identified, which the author calls eigenstate thermalisation. A quantum and a classical 
version of a hard sphere gas serve as prototypical examples to illustrate this mechanism. 
A central role is played by Berry’s conjeeture. It states that in certain quantum systems, 
whose classical counterparts exhibit classical chaos, the energy eigenstates to energies 
in the bulk of the spectrum ar e sup erpositions of plain waves with random phases and 
random Gaussian amplitudes 3071 ]. It is argued that in the hard sphere gas, whose 


classical version is indeed chaotic, all energy eigenstates that satisfy Berry’s conjecture 
have a single particle momentum distribution that is thermal. Finally, thermalisation is 
explained by the accumulation of relative phases between energy eigenstates due to time 
evolution. This de-phasing destroys any fine tuned setting of the phases that might have 
been present in the coherent superposition of energy eigenstates that made up the initial 
state. Su ch a fine tuning is necessary to get an initial state that is out of equilibrium. 

Ref. jl89| aims at providing a quantum mechanical justification for the applicability 
of statistical ensembles. The main idea is to model interacting composite quantum 
systems by starting with a non-interacting Hamiltonian that can be well understood, and 
then modelling generic effects of the interactions by addin g a small r andom Hamiltonian 
— very much in the spirit of random matrix theory 186l - ll88l |. Due to the fact 
that composite quantum systems generically have exponentially dense spectra, i.e., 
either exponentially small gaps between neighbouring eigenvalues and/or exponentially 
large degenerate eigenspaces, any extremely small perturbation will typically mix an 
exponentially large number of energy eigenstates of the non-interacting Hamiltonian. 
This smears out their individual properties and should make the expectation values 
of physical observables in individual energy eigenstates of the perturbed Hamiltonian 
similar to those in a micro-canonical state with a similar mean energy. 

A much mor e rig orous formulat ion o f the idea behind eigenstate thermalisation can 
be found in Ref. 105| (see also Ref. 308j). This article considers bipartite systems with 


V = SiJB, whose non-interacting part Hq = Hs+Hb of the Hamiltonian H = Hq-\-Hj is 
non-degenerate. The interaction Hamiltonian Hj is assumed to couple only neighbouring 
energy levels, i.e., it is of the form 


Vfceld]: = 


(98) 
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for some A G M such that -C A with the maximal spacing between the 

energy eigenvalues of Hb and e™™ the minimal level spacing of Hs- It is hrst argued 
heuristically and then proved, under some additional technical assumptions, that such 
Hamiltonians indeed exhibit eigenstate thermalisation in the sense that for most k and 
all observables As with supply) C S' it holds that {Ek\As\Ek) ~ Tt{As g[Hs]{f3{Ek))) 
(see Eq. (5) and (6) in Ref. 308[). 

The eigenstate thermalisation hypothesis (ETH) gained wide popularity after the 
very influential article Ref. j^, which states the ETH as follows: 

Conjecture 1 (Eigenstate thermalisation hypothesis as stated in Ref. 


33|). The 


expectation value {Ek\A\Ek) of a few-body observable A in an eigenvector \E)k of the 
Hamiltonian, with energy Ej., of a large interacting many-body system eguals the thermal 
[■ ■ ■] average of A at the mean energy E^. 

It is emphasised that thermal average in this context can also mean the micro- 


canonical average. Ref. 33j studies a system of hard core bosons on a lattice. It is 


demonstrated that the observed thermalisation can be explained by the fact that certain 
physically relevant observables have expectation values in most energy eigenstates that 
indeed resemble those in a micro-canonical state. The validity of numerous variants of 
the ETH has been studied extensively and in great detail at hand of many physically 
relevant models. This will be detailed in Section 17.61 

A slightly generalised and sharpened version of the ETH that captures the spirit of 
eigenstate thermalisation and applies to degenerate Hamiltonians is the following: 

Definition 3 (Eigenstate thermalisation hypothesis (ETH)). A Hamiltonian H fulfils 
the eigenstate thermalisation hypothesis in a set R C M o/ energies with respect to a 
set j\4 of POVMs if and only if all its spectral projectors H^. to energies E^ & R have 
the property that there is a sufficiently smooth function /3: R —?• M"*" such that for each 
k with Ek & R it holds that for all normalised pure states G S{'H) with the property 
f) < Hk the distinguishability T)Mi'f ’1 g[H]{ld{Ek))) is sufficiently small. 

Again, we have deliberately left open what is meant by “sufficiently smooth” and 
“sufficiently small”. 

It is still open under which precise con ditions the ETH holds in this or a similar 

have so far not been generalised to more 


105 


form. The rigorous derivations of Ref. 
reasonable physical interactions. Methods to analytically check the ETH in “non- 
integrable models” that are interesting in the context of c ondensed matter theory 


currently seem to be out of reach. Very recently in Ref. 309| a statement remi niscent 
of the ETH was proved under fairly general conditions. More precisely. Ref. jdOoll shows 
weak local diagonality (Theorems 4 and 38) of the energy eigenstates of a certain type 
of Hamiltonian. In the language used here a slightly simplihed version of this statement 
can be formulated as follows: 


Theorem 10 (Weak local diagonality 309|). Consider a locally interacting spin system 
with Hilbert space Ti and Hamiltonian H G 0{'H) whose interaction graph iV^S) is a 
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hypercuhic lattice of spatial dimension D and let S G B G V be subsystems. Then there 
exist constants C,c,v > 0, which depend only on D and the local interaction strength 
J := maxxef ll-^x|loo Hamiltonian such that for any energy eigenstate \E) of H 

to energy E there exists a state G S{'Hb) that satisfies for any two energy eigenstates 
\EP), \E^) of Hbib with energies Ej^ and E^ 

\{EP\p^\E^)\ < ( 99 ) 


and at the same time 




y / 


g-c diS,B-)/2 


( 100 ) 


Essentially the theorem tells us that if S is sufficiently far from the boundary of 
B, then for each energy eigenvector \E) of H there exists a state in S{'Hb) that is both 
approximately diagonal in the eigenbasis of Hb\b and locally on S hard to distinguish 
from \E){E\. If one could improve the result to the effect that it would show local 
indistinguishability not only from an approximately diagonal state but from a thermal 
state then it would constitute a proof of an ETH like statement. Ho weve r, such a 
generalisation can almost surely hold only under additional assumptions 

The ETH, as defined in Definition [3], is sufficient for thermalisation in the following 
sense: 


Observation 2 (Thermalisation in systems that fulfil the ETH). Systems whose 
Hamiltonian H G 0{'H) fulfils the ETH, as stated in Definition\B, for a set R G M. 
of energies with respect to a set M of POVMs, thermalise on average with respect 
to the set Ai, in the sense of Definition\^ for all initial states for which the system 
apparently equilibrates on average with respect the restricted set M of POVMs (see also 
Section \3. 1\) and whose energy distribution is sufficiently narrow and contained in R, 
i.e., Ek ^ R Tr(nfcp(0)) = 0. 


The fact that the ETH is sufficient for therm alisation in this or a similar sense is 
widely known (see, for example. Refs. 1051 . Il74| ). It is worth noting that the strong 
requirement in Definition [3] that the distinguishability g[H]{/3{Ek))) must be 

small for all normalised pure states f < H^ is crucial for the above observation to 
hold. At the same time, this requirement obviously becomes harder to satisfy the more 
degenerate the Hamiltonian is. 

If one takes the point of view that one should say that a system thermalises only 
if it thermalises in the sense of Definition [2] for all equilibrating initial states with 
a sufficiently narrow energy distribution, then fulfilment of the ETH is at the same 
time essentially also necessary for thermalisation. We will not make this statement fully 
rigorous, but the intuition behind it is as follows: If the ETH is not fulfilled, there should 
always exist initial states with a narrow energy distribution that only have overlap with 
energy levels that, for certain observables or POVMs, produce a measurement statistic 
that is sufficiently far from that of the closest thermal state. This distinguishability 
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from the thermal state will then still be visible in the de-phased state and hence survive 
de-phasing and equilibration. 

Such arguments, and the above mentioned apparent connection between the ETH 
and (non-)integrabihty, has lead some authors to proclaim H 0 0, H H, 03 
that non-integrable systems thermalise and integrable systems do not. While there is 
evidence that in many models this is indeed the case, we will see in Section [8] and |9] that 
the situation is in fact more involved. We will given an overview of the numerical and 
analytical literature on thermalisation in the context of the ETH in Section 17.61 

We have seen that the ETH as dehned in Dehnition [3] is by construction essentially 
sufficient and, in a certain sense, necessary for thermalisation. The necessary part, 
however, only holds if one is willing to call a system thermalising only if it thermalises for 
a given set of POVMs for all initial states with a sufficiently narrow energy distribution 
for which it also apparently equilibrates. Hence, there is the possibility to show 
thermalisation in systems that do not fulhl the ETH, if one is willing to restrict the 
class of allowed initial states. As we will see in the following this can indeed be done. 


7.3. Subsystem thermalisation under assumptions on the initial state 

In this section we discuss a second approach towards the problem of thermalisation 
that is independent of the eigenstate thermalisation hypothesis (ETH). Instead of 
making strong assumptions concerning the properties of the energy eigenstates of the 
Hamiltonian we will show thermalisation under stronger assumptions concerning the 
energy distribution of the initial state. This alternative and complementing approach 
is inspired by an argument from classical statistical mechanics, which we will lift to the 


311 


quantum setting. The details of this approach were hrst worked out in Ref. 

The hrst motivation for this work comes from the fact that explaining thermalisation 
by using the eigenstate thermalisation hypothesis has one important drawback — that 
the ETH is indeed a hypothesis. One could make the provocative claim that this leads 
to the ironic situation that attempts to explain thermalisation by the ETH have the 
following problem; They essentially try to explain one phen omen on that is not well 
understood by another one that is almost as little understood 312| . 

The second motivation comes from the consideration that demanding thermalisa¬ 
tion of all initial states with an energy distribution that is only required to be narrow 
but otherwise allowed to have arbitrary complex structure is asking for too much. 

In the light of typicality arguments (Section [6]) it seems plausible to restrict the 
class of initial states for which one tries to show thermalisation, or even to be content 
with an argument that shows thermalisation for most states from some measure. In 
addition, certain restrictions on the initial states are anyway already necessary to prove 
equilibration on average in the hrst place (Section 13.2p . and practical limits on the 
experimental capabilities can be used to arg ue that inany initial states of macroscopic 
objects are essentially impossible to prepare 106 . 108 . H i. 

The third motivation conies from the known fact that in some systems the ETH 
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Kinematic 


Dynamic 


Figure 5. (R eproduction from Ref. |l20l |') Structure of the proof of thermalisation 


from Ref. 311 1. 


is not fulfilled and this has been linked to the integrability of these models, while non- 
integrability is often associated with a fulhlment of the ETH and thermalisation (see for 


example Refs. 33l . \A‘ A l55l .l63l l84j .Il78lIl84j.l22ll.l26lll27ll. l313 |). What (non-)integrability 


even mea ns in the context of quantum mechanics is, however, far from being settled 
3li, Eli (see also Section [9]). It is thus of interest to approach the problem of 


thermalisation in a way that is independent of the concept of integrability. 

As we will see in the following, restricting the class of initial states makes it possible 
to rigorously prove subsystem thermalisation on average without any reference to the 
ETH for both spin and fermionic systems. The overall structure of the argument 
is depicted in Figure [5l The result that we will derive and discuss in this section 
can be combined with either the typicality theorems from Section [6] or the dynamical 
equilibration theorems from Section [3l The former yields a kinematic thermalisation 
statement (Observation 0]) that holds for most Haar random states from a certain 
subspace. The latter yields a dynamic thermalisation result (Observation [5]) that proves 
thermalisation on average in the sense of Definition |2] for all initial states from a certain 
class of states. It is hence closer to the thermalisation statement obtained under the 
ETH (Observation [2]), which we discussed in the last section. 

In essence, the proofs of the statements presented in this section are translations of 
the classical derivation of the canonical ensemble for small subsystems of large weakly 
interacting systems that are described by a micro-canonical ensemble to the quantum 
setting. The main difficulty is that in quantum mechanics the interaction between the 
small subsystem and the bath not only shifts the eigenenergies of the non-interacting 
Hamiltonian, but, in addition, signihcantly perturbs the energy eigenstates. In many 
previous accounts of the thermalisation problem this issue has been partial ly overloo ked 


68 




or at least not been addressed rigorously. Compare for example Refs. 

How does the interaction influence the Hamiltonian? The eigenvalues of the 
interacting Hamiltonian are shifted at most by the operator norm of the interaction 


Hamiltonian with respect to those of the non-interacting Hamiltonian |88l . Theorem 
HI.2.1]. As long as the interaction is weak, in the sense that its operator norm is 
small compared to an energy uncertainty or measurement resolution, the change in the 
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eigenvalues will thus be insignificant. 

The energy eigenstates, or in the case of a degenerate Hamiltonian the spec tral 
projectors, are much more fragile. Naive perturbation theory breaks down j316| as 
soon as the strength of the perturbation is larger than the gaps of the non-interacting 
Hamiltonian. The gaps of a locally interacting quantum system are, however, usually 
exponentially small in the system size. Indeed, if the non-interacting Hamiltonian Hq 
and the interaction Hamiltonian Hj are not diagonal in the same basis, the energy 
eigenstates of H = Hq + Hj will usually be markedly different from those of Ho. 

Before we tackle this problem, let us consider the non-interacting case, i.e., a 
Hamiltonian of the form Hq := Hs + Hb- Let Ho and Hsis have spectral decompositions 
Ho = Hs\s = respectively. Moreover, let and 

i\^m))rn=i ^*6 some orthouormal eigenbases with corresponding eigenvalues and 

of Hsis cind Hb\b, respectively. The Hamiltonians Hs^s^ Hb^b, and Ho are 
allowed to have degeneracies, i.e., I ^ I' =/^ Ef ^ E^ and m ^ m' =jh- E^ ^ E^, 
and the bases are not unique. Remember that, on the other hand, by definition, the 
elements of the sequences and {Ef)f^^ are distinct. 

We first look at the case of spin systems. In such systems each of the spectral 
projectors H^ of Ho is of the form 

n!:= E \Ef){Ef\tS\E^){E^\. (101) 

/,m: Ef+EB=E0 

The micro-canonical state n[Ho]{[E, -|- A]) to an energy interval [E, -|- A] is hence 

proportional to 

nlff,l(lB,i5+A])« (102) 

k: E°G[E,E+A] l,m: Ef+EB=E0 

Its reduced state n'^[i7o]([-E, E + A]) = Tr^ n[Ho]{[E, -|- A]) on S' therefore satisfies 

n“lff„l(lB,i5 + A])« ^ 5^ |Bf)(Ef| (103) 

k: E°G[E,E+A] l,m: Ef+EB=E° 

= E E,\Ef){Ef\\{m: E;^ + E^. = E»}\ (104) 

k: E'^e[E,E+A] l=l 

d's 

= H EnfK™: + s® = b2}| (105) 

k: E°e[E,E+A] l=l 

d'g 

= ^nf|{m:i5f + E®6|i5,B + A|}| (106) 

1=1 

d'g 

= 'E^f*4BBw](E-Ef), 

1=1 


( 107 ) 
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where 

#^[IlBrB](J^) ■■= \{m: Eg e [E,E + A]}\ = Te,nk{A[HB^B]{[E,E + A])) (108) 

is the number of orthonormal energy eigenstates of the bath Hamiltonian Hb to energies 
in the interval [E, + A], 

For systems of fermions Eq. fllOip does not hold, because the Hilbert space of the 
joint system is not the tensor product of the Hilbert spaces of the subsystems. However, 
the following quite lengthy calculation shows an equivalent result also for fermionic 
systems. Readers not interested in the details can safely jump directly to Observation [3l 

Denote by fx,fx fhe fermionic annihilation and creation operators on "H and by 
fx, fl with X E S those acting on l-is and tor x E B those acting on I-Lb- Furthermore, 
denote the vacuum state vector in Tf by |0) and the projectors in H('H) onto the subspace 
with no particle in system S' or R by |0)(0|5, and |0)(0 |b, respectively. The projectors 
|0)(0|5, |0)(0 |b, and |0)(0| are all even operators and |0)(0| = |0)(0|5 |0)(0 |b. For each 
I E [ds] let be the representation of the eigenstate |Rf) as a polynomial in the 
fermionic operators on Bs, i.e., |Rf) = pf^®((/s, fl)ses) |0)s, and likewise for Note 
that the pg^ and the pgf^ are either even or odd polynomials as otherwise the projectors 
|Rf)(Rf| and \Eg){Eg\ would not be even. Furthermore, note that commuting two 
polynomials that are both either even or odd gives a global minus sign only if both 
polynomials are odd. As Rg and EIb are even operators it is straight forward to verify 
that the states \EJ + Eg) := pf®((/s,/]) sgs) p^®((/6,/fe)6GB) |0) are eigenstates of R° 
to energy Ef + Eg. In fact, they form an orthonormal basis of EL in which R°, Rg, 
and Hb are jointly diagonal. For the sake of brevity we omit the subscripts ses and 
bgs in the following calculation. It is again straight forward to verify that for any even 
operator A E BifH) with supp(A) C S' it holds that 

IV (A |Bf + + £)»l) 

= Tr (Apfmf,, /;))?"'((*, /*)) lOXOIs |0X0|bp"«((A. A*))tpF((/.. /.))*) (109) 

= iv(/ip"n(/„/t))|0X0|sp"n(/../]))*P.7((A.k))|0X0|sP."*((A.k))*) (110) 

= lV(/l|Bf)(£)f|). (Ill) 

The last step can be shown by explicitly writing out the trace in the Fock basis and 
inserting an identity between the operators that are supported on S and those supported 
on B. 

Now, note that any operator A E B{'H) with supp(A) C S can be written as a sum 
of an even and odd part and that only the even part can contribute to an expectation 
value of the form Tr(A \Ef + Eg){Ef + Eg\). The above calculation is hence sufficient 
to show that (remember the dehnition of the partial trace in Eq. (IM|) 1 

W e lds],m 6 Ids]: TVsdEf + E^){E^ + B®]) = l^fX^f 


( 112 ) 
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Finally, realizing that 

n|i/„](|B,£: +A|) 

= E E + E^){Ef + E^\) (113) 

k: E°£[E,E+A] l,m,: Ef+EB=E° 

yields an expression equivalent to Eq. (I102j) and the proof then proceeds analogously. 
We summarise the result of the above calculation in the following observation: 

Observation 3 (Gibbs states as reductions of micro-canonical states of the 
non-interacting Hamiltonians). Let [E, E + A] be an energy interval and Hq = Hg + Hb 
a non-interacting Hamiltonian of a bipartite quantum system of spins or fermions with 
V = S L) B. If for some /? G M holds that 




(114) 


then n‘^[i7o]([-^) E + A]) takes the well known form of a thermal state, i.e., 


d'a 


+ A]) « ^nf « 9[ffs,sl(/3) = /[ffol(/3). (115) 


1=1 


Note how /9, which was introduced in Eq. (I114|) simply as a parameter describing 
the shape of the number of states, ends up being the inverse temperature of the thermal 
state g[Hs\s]{/3) of the subsystem S. Similar calc ulations (at least for spin systems) can 
be found for example in Refs. 69|, If 051. l283l. l294| and in many textbooks on statistical 
mechanics. 

For finite dimensional baths the proportionality ifA[IlB\B]{E) cx can nev er be 


exactly fulfilled simply because # a [Hb fs] (E) is not continuous. A detailed analysis |311 


Appendix A] shows that if the logarithm of the number of states ln{^/^[HBiB]{E)) can be 
sufficiently well approximated by a twice differentiable function whose second derivative 
is small compared to the width of the relevant energy range [E — , E ||hfs||oo], 

then Eq. (I115j) is fulfilled approximately. The first derivative of this approximation ends 
up being the inverse temperature of the thermal state, the second derivative enters the 
error bound. 

It is widely known that natural locally interacting Hamiltonians H with bounded 
local terms “generically” have an approximately Gaussian number o 
if the system size is sufficiently large 69|, Section 12.2] (see also Ref. 


states 


mE) 

3171 . l318| for some 



rigorous results). It is more common to refer to the density of states in this case, which 
is essentially the limit of ^a[H]{E)/A for A small and increasing system size. If the 
bath Hamiltonian Hb is taken to be such a model with a nearly Gaussian density and 
number of states, the approximation by a twice differentiable f uncti on is possible and 
the distance V{\1^[Hq]{[E, E A]), g[Hs\s]{/d)) can be bounded 311 . Appendix B] and 
is usually exponentially small in the size of the bath. In the following we will call locally 
interacting systems that have this property “generic”. 
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The value of f3 for which V{\1^[Hq]{[E, E + A]), g[Hs^s]i.l^)) is small depends on E. 
If is indeed close to a Gaussian, then can be well approximated 

by an inverted parabola. Its hrst derivative, which is essentially the optimal /9, is large 
for low values of E, thus associating them with low temperatures. For values of E in 
the center of the spectrum it goes to zero, corresponding to inhnite temperature, and 
becomes negative for even higher values of E. 

In conclusion, we can say that the reduction on S' of a micro-canonical state to an 
energy interval [E, E + /A\ oi a, system that is a composite system with V = S (J B and 
without any interaction between S and B, whose Hamiltonian EIb on i? is a “generic” 
many-body Hamiltonian, will typically be exponentially close to a Gibbs state of Hs 
with an inverse temperature (3 that depends in a reasonable way on E. This works for 
all values of E that are neither too low nor too high. At the edges of the spectrum the 
number of states of the bath will be too low to allow for a good approximation of the 
number of states by a twice differentiable function. In addition, A must be both small 
compared to ||hf||oo and large compared to the largest gaps in the spectrum of H in the 
relevant energy range. 

Now we consider the influence of an interaction between S and B. The challenge 
posed by the fact that such an interaction will typically markedly perturb the energy 
eigenstates can be overc ome by a perturbation theorem based on a result of Ref. 

(see also Refs. 


319 . l32n| ) for projectors that are sums of spectral projectors. 


The orem 11 (Stability of sums of spectral projectors (implied by Theorem 1 of 
Ref. 31l|)). Given an energy interval [E, E + A] and two Hamiltonians if, H' G 0{'H) 
with spectral decompositions H = and H' = 'Yhk^'k^'k- ^ P' 

projectors that are sums of the spectral projectors H^ and H'^ to energies in [E, i? -|- A] 
of H and H', respectively, i.e., 




and 


P' E ni. 

k: E'^^&[E,E+A] 


(116) 


Then for every e > 0 


11^_ii9| 

\P — -Pill < (rank(P) -|- rank(P'))- — -|- rank(Pe) -|- rank(Pg') (117) 


where 


Pe ■■= 

k: Ek£lE,E+e]U[E+A-e,E+A] 


(118) 


and 


Pi := 


Hi 


E 

k: E'^,e[E,E+e]U[E+A-e,E+A] 


(119) 


The rather technical theorem stated above has immediate consequences for the 
stability of micro-canonical states: 
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Corollary 1 (Stability of micro-canonical states 0). Given an energy interval 
[E, E + A] and two Hamiltonians H, H' G 0{'H) with spectral decompositions H = 
Y., Et and H' = Yt K it holds that for every e > 0 

H'\ 


V {n[H]{[E, E + A]), n[H']{[E, E + A])) < 


H 


+ 


An + n^ 

90 


( 120 ) 


where flmin/max := min / niax (rank(n[i7]([ii^, -|- A])), rank(n[i7']([ii^, -|- A]))), O := 


f^max and 


n, :=Tank{n[H]{[E,E + e\ U [E + A - e, E + A])) 

+ Tank{n[H']{[E,E + e] U [E + A - e, E + A])). 

Proof. By the triangle inequality 

|P- P'lli + AO 


V (n[P]([P,P + A]), n[P']([P,P + A])) < 


20 


with P, P' dehned as in Eq. flll6p . Theorem [TT] hnishes the proof. 


( 121 ) 


( 122 ) 

□ 


What is the meaning of the corollary? The statement is non-trivial if ||P — P'||^ 

A. Then one can expect that there exists an e with the property that ||P — P'|| oo « 
e <C A, such that both ||P — P'||^/e <C 1 and (AO -|- 0£)/(2 0max) "C 1. Under the 
assumption of an approximately uniform density of states one hnds that (2 Umax) ~ 
2e/A and A0/(2 0max) ~ ll-^~-^1loo/^ such that the optimal choice for e is 
approximately e ~ ^y\\H — H'\\^ A/2, which yields 

1/2 


B(n|J/](/),n[J/'](/))<4 


\H - H’ 
A“ 


(123) 


While the above example provides some intuition for how powerful Theorem [TT] and 
Corollary [1] are, the case of a uniform density of states is not the relevant situation if 
one is interested in showing thermalisation. As we have seen in the beginning of this 
section, for n^[HQ]{[E, E + A]) to become approximately thermal it is necessary that 
the number of states of the bath grows exponentially with P. What happens in this 
case? 

First, notice that the two terms in the right hand side of Eq. fll20p are non-negative 
and hence must both be small individually for the inequality to become non-trivial. For 
the interesting case H = Hq + Hj and H' = Hq this implies that it is necessary that 
||P/||oo -C e, so that the hrst term can become small. For the second term we restrict 
our attention to U£/(2Umax) as AU can reasonably be assumed to be smaller than U^. 

If to good approximation 


#i|ff„](B)«#i[ffl(£)«e-'=^ 


then 


3 111 . Appendix H] 




> 


1 — e 


2U, 


2(1 




(124) 


( 125 ) 
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That is, for Corollary [T] to be non-trivial it must be possible to chose an e such that 


(3 \\Hi\\^<^(3e < 1 . 


At the same time, if Eq. (I124p is fulhlled, then also 3 111. Appendix H] 

^ /5 e 


< 


2 a 


1 — 


Under the reasonable assumption that Ar2/(2r2max) "C 1 the choice e 
yields 

p(n[a](/),n[ao](/)) < 2-^/^^ 
which gives a non-trivial upper bound as long as 


l_e-/3A ’ 


l«/llo„«l//3«A. 


(126) 

(127) 

VWAJll 

(128) 

(129) 


Concluding, we can say that for reasonable bath Hamiltonians Hb, and if the coupling 
is weak enough and A large enough such that Eq. (I129j) is fulfilled, then one can expect 
that 

V (n»|ff|(lB,i5+ A]), jlffsisKO)) 6 o ((/? IlftlD*'"), (130) 

i.e., that the reduced state on subsystem S of the micro-canonical state is close to a 
Gibbs state of the restricted Hamiltonian truncated to S. Corollary [1] and the above 
discussion quant ify t he errors in the approximate equalities Eq. (7) in Ref. 68j and 
Eq. (18) in Ref. H. 

For the rest of this section we consider a bipartite quantum system with V = S' U R 
of spins of fermions with Hamiltonian H. Let Hq := Hs + Hb and Hj := H — Hq. We 
are now in a position to state the kinematic version of the thermalisation result, which 
follows from the above discussion of Corollary [1] and Theorem [9l 


Observation 4 (Most Haar random states are locally thermal 31l| ). Let R := 
[E, E + A] be an energy interval and Hr C H the subspace spanned by all eigenstates of 
H to energies in R with dimension dR := dim('RR). If the bath has a “generic” locally 
interacting Hamiltonian with the property that for energies in [E, E + A] the logarithm 
of the number of states Iii^^IHbib] can be well approximated by an affine function with 
slope (3 and if moreover A is sufficiently large and the interaction sufficiently weak such 
that 

ll^/IL«l//9« A, (131) 

and the interval R is sufficiently far from the edges of the spectrum, then for every e > 0 


P 

aar [Hr] 


V 




( 132 ) 


where C = 1/(36 vr^) and 6 {Hb) decreases fast with the size of the bath. 
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To state the dynamic result we introduce the notion of rectangular states 311 


We call a state p G S{'H) of a quantum system with Hilbert space "H and Hamiltonian 
H G 0{'H) rectangular with respect to an energy interval [E, + A] C M if de-phasing 

with respect to H yields the micro-canonical state corresponding to [E,E + A]. For 
example, if H has no degeneracies, then a state is a rectangular state if, when expressed 
in the eigenbasis of H, it has non-zero matrix elements only in some diagonal block and 
the same value for each entry on the diagonal in this block. The class of rectangular 
states is not a very large class of states, but generally comprises a lot of pure states 
and usually also states that are out of eguilibrium, in the sense that their reductions on 
a small subsystem are well distinguishable from a thermal state and at the same time 
have a sufficiently widespread energy distribution such that Theorem [1] can be used to 
guarantee equilibration on average. Nevertheless, all these states have a tendency to 
thermalise dynamically: 


Observation 5 (Thermalisation on average (31l| ). Let R := [E,E + A] be an energy 
interval. Let the bath have a “generic” locally interacting Hamiltonian with the property 
that in an energy interval [E^ £'-|-A] the logarithm of the number of states \n{^A[HB\B]) 
can be well approximated by an affine function with slope fd. If A is sufficiently large 
and the interaction sufficiently weak such that 


ll^/IL«l//9«^> (133) 

and the interval R is sufficiently far from the edges of the spectrum, then the time 
evolution is such that the subsystem S thermalises on average, in the sense of 
Definition for any initial state p(0) G iS('H) that is rectangular with respect to R 
in the sense that 

v(pS(t),a[Hs,smf < I (a'(£) 4 g{(Pk)U)) + HHb) + o ((/? ||//,IL)‘'"), 

(134) 

where S{Hb) decreases fast with the size of the bath, and, as in Theorem^ 

N{e) := sup|{(fc,/) G [df: k I A E^ - Ei e [E,E + e]}\ 

d' 

9{{Pk)i=i) ■= min(y'pfc,3max'pfc), 

' ^ k 

k=l 

with {pk)k=i Ihe energy populations, i.e., pk ■= Tr(nfcp(0)), and maxfpk the second 
largest element in {pk)^^i. 

The class of rectangular states seems fairly unnatural on first sight, however, 
condition of being rectangular can be slightly weakened. For small deviations from 
a rectangular state Observation [5] still essentially holds, just an additional error must 
be taken into account. If the deviation from rectangular is in a sense uncorrelated with 
the relevant properties of the energy eigenstates, then even relatively large deviations 
should be tolerable as the errors will not accumulate but rather cancel each other out. 


(135) 

(136) 






7 THERMALISATION 


65 


In the worst case, however, the deviation from rectangular could be highly correlated 
with the expectation value of, say, a local observable. Then, even small deviations from 
rectangular can lead to noticeable deviations of the equilibrium state from a th erma l 
state, that this can indeed happen in natural models for natural initial states 


254 


In this sense the condition of being rectangular is necessary for thermalisation if no 
conditions on the energy eigenstates are to be imposed. 

A comment on the notion of weak coupling used here is in order: The condition 
that is needed for the above results to be non-trivial is (compare Eq. (IldOj) ) 


/3 \\Hi\L « 1- (137) 

This is a signihcant improvement over the condition that would be necessary to 
guarantee that naive perturbation theory on the level of individual energy eigenstates 
is applicable (namely that is much smaller than the gaps of Hq). While the 

gaps of Hq become exponentially small with the system size (3 can be expected to be an 
intensive quantity, i.e., to be independent of the system size. It may be worth noting 
that conditions similar to this have been considered in very practical cont exts, say, when 
studying the thermalisation of two weakly coupled hnite metallic grains (32l|. 

In the case of a ID system with short range interactions and if S' is a set of 
consecutive sites ||77/||^ is also intensive. In this case, Eq. (I137p is a physically natural 
condition to call the coupling weak. In the analogous situation in higher dimensional 
lattices, for example a system with nearest neighbour interactions on a 2D square lattice 
and S the sites inside a ball around the origin, ||i7/||oo, however, scales with the surface 
of the region S, making the above bounds useless already for medium sized \S\. Thus, 
the above results are not entirely satisfactory. 

The reason for this is essentially that the trace distance is a very sensitive metric. 
If r\^[H]{[E, E + A]) and g[Hs\s]{P) the optimal (3 only differ slightly on each of 
the sites along the boundary of S, then their trace distance (at least as long as it is 
sufficiently far from one) will be approximately proportional to the surface of S. In 
consequence, the unfavourable scaling of the given error bounds is expected. 


7.4- Thermalisation in translation invariant systems and equivalence of ensembles 


Thermalisation and the related question of the equivalence of ensembles have recently 
also been investigated in the more concr ete setting of (translation invariant) locally 


interacting systems on cubic lattices 


309 


The additional structure can be used to go 


beyond the results discussed in Sections 17.31 In this s ectio n we discuss the main results 
of Ref. 


309| and the general isati o ns a chieved in Ref. 322|. 


More concretely. Refs. 3091. l322| consider systems with k-local Hamiltonians on 
cubic lattices. A Hamiltonian H is k-local if for some spatial dimension D G Z+ and 
linear size n G Z+ the vertex set of the system is V = [n]'^, the edge set £ contains 
only subsystems X of diameter at most k measured in the graph distance of the lattice. 


and the corresponding local terms H have norm bounded by one, i.e., \\Hx\ 


= 1 . 
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Furthermore, a Hamiltonians H is called translation invariant if for any two subsystems 
X, X' C V t hat differ only by a translation on the lattice it holds that Hx = Hx>- 

In Ref. 3n9| a family of translation invariant systems of increasing size is considered 
thermalising if they equilibrates on average to a state that, in the limit of infinite system 
size, becomes indistinguishable from thermal states of that stem. Th is is a very natural 
notion of thermalisation in the translation invariant setting. Ref. 309| contains theorems 
very reminiscent to both the kinematic thermalisation result (Observation 0]) and the 
dynamical result (with a similar conditions on the initial state) on thermalisatio n on 
average (Observation [5]) for this notions of thermalisation. The results of Ref. j309 


are applicable in situations with strong coupling between subsystem and bath, i.e., 
||i7/||^ > 1//3 but are only asymptotic statements and work only for tem peratures 
around which the translation invariant system has a “unique phase” (see Ref. |145l . 1309 


for more details) in the limit of infinite system size. To understand what a “unique 
phase” is note that in the limit of infinite system size a translation invariant state p is 
given by a series of subsystems states px which for all X C X' C V fulfil the consistency 
conditions px = Trx= Px'- If is then instructive to define a translation invariant state p 
of the infinite system to be thermal if it minimises the free energy density 


f{p) := lim 
II —^oo 


Tr(i7xPx) — S{px)/(3 

W ■ 


(138) 


Whenever |V| is finite, this definition is consistent with our definition of a thermal state 
from Eq. (|38l) and moreover the thermal state is unique. In infinite systems, however 
this is not the case any more and one hence says that a system has a “unique phase” 
around some inverse temperature ft if for all inverse temperatures close to ft the system 
has a unique thermal state in the above sense. At low temperatures this condition is 
often violated (for example in the 2D Ising model below the Curie temperature). In 
contrast, at high temperatures the existence of a unique phase is always ensured (see 
Section [T0|) . 

The kinematic and dynamics thermalisation results of Ref. j3n9l| rest on a 
eguivalence of ensembles theorem. Two ensembles, for example the canonical and 
micro-canonical ensemble, are said to be eguivalent here if their corresponding states 
become indistinguishable on small subsystems when the total system size is increased. 
More concretely: given a locally interacting spin system with Hamiltonian H G 0{'H) 
and an inverse temperature ft, under which conditions does there exist a suitable 
energy interval [E, E + A] such that for all sufficiently small subsystems S CV the 
distinguishability V{g^[H]{ft), n^[H]{[E, E-l-A])) is small. Unfortunately Ref. |3n9| does 
not give concrete finite size bounds on the distinguishability but only makes statements 
about the asymptotic behaviour. 

This however was recently achieved in Ref. j322l | . together with a generalisation 
to systems without translation invariance. Before we can explain this result in more 
detail we need to introduce the notion of z)-exponentially decaying correlations that 
is closely connected to the notion of a “unique state” encountered before. Remember the 
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definition of the covariance from Eq. fl35p . A state p of a system on a lattice is said to 
have z)-exponentially decaying correlations if for some constants ^,z ^ M"*" and any 
two observables A, B ^ 0(1-L) 


cov,(4-S) < ll-4|L||B|LAf'e-'‘'-<'=>y 


(139) 


where d is again the graph distance of the lattice. 

A simplihed version of the main result of Ref. j322l| can then be phrased as follows: 


Theorem 12 (Equivalence of ensembles 3221. Theorem 1]). Fix a spatial dimension 
D G , a locality parameter k G Z"*", a linear region size I G Z"*", an inverse temperature 
fd, and ^,z ^ M’*'. For n G Z"*" consider an infinite family of spin systems with vertex 
sets V = [n]^, Hilbert spaces Hy, and k-local Hamiltonians HyO{'H). If the family 
of thermal states g[Hy]{l3) has z)-exponentially decaying correlations, then for the 
family n[i7v]([-^v ~ ^v/2, Ey + Av/2]) of micro-canonical states with 


Ey.= TT{Hyg[Hym) 


(140) 


and 


Av := 


N 


Tr {Hlg[Hym) - Tr {Hy g[Hym)‘- 


1/2 


(141) 


it holds that 


\im V{n^^[Hy]{[Ey,Ey +Ay]), g^^[Hym) =0 (142) 

n^oo 

for any family Xy gV of subsystems whose diameter grows at most as fast as 

The main virtue of Ref. 322| is that it actually gives a concrete hnite size bound 


on the average distance between t he ca nonical and the micro-canonical state on hyper- 
cubic subsystems. In short, Ref. 322| shows that and how the canonical and micro- 
canonical states become indistinguishable on any sufficiently small subsystem when the 
total system size increases given that fd is such that the thermal state of the total system 
has exponentially decaying correlations. We see later in Section [10] that at sufficiently 
high temperatures the necessary correlation decay can always be ensured. 


7.5. Hybrid approaches and other notions of thermalisation 

We have seen in the last sections that both approaches to explain thermalisation, 
the eigenstate thermalisation hypothesis and thermalisation under assumptions on the 
initial state, have their advantages and drawbacks. They can be understood as extreme 
scenarios. In most cases where thermalisation of closed quantum systems happens it 
is probably due to a mixture of the two effects. An interpolation between the two 
previou sly d iscussed approaches is provided by the eigenstate randomisation hypothesis 
(ERH) 323|. The ERH is a weaker condition than the ETH. Instead of demanding that 
for certain observables the expectation values of all individual energy eigenstates with 
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nearby energies give approximately the same expectation value (compare Conjecture [T] 
and Definition [3]), the ERH requires only that the variance of certain coarse-grainings of 
the sequence of expectation values of an observable in the energy eigenstates becomes 
sufficiently small. This, together with a condition on the smoothness of the energy 
distribution of the initial state that is milder than what we required when we introduced 
the class of rectangular sta tes, i s sufficient to prove a thermalisation result that is similar 
in spirit to Observation [5] 323| . Agai n, n umerical evidence for the validity of the ERH 


in certain models has been collected |323| . 

It seems worth repeating that the notio n of thermalisation used here is surely not 


the only reasonable one. For example Ref. 132| works in the setting of macroscopic 


commuting observables of von Neumann, which we discussed briefly in Section [6l A 
system is declared to be in thermal equilibrium if there is a phase cell that is much 
larger than all others and the state of the system is almost completely contained in the 
subspace corresponding to this cell. 

Many other definitions of thermalisation or thermal equilibrium in quantum many- 
body systems are possible. For example, in the context of the ETH it is sometimes said 
that a system is thermal if the expectation values of a given observable in the energy 
eigenstates of a system are, up to small fluctuations, smooth functions of the energy 
(compare for example Ref. |45|). The validity of fluct uation-di ssipation theorems has 
also been considered as a condition for therm alisa tion 324, l325l | . 

The notion of relative thermalisation 326| focuses on yet another aspect of 
thermalisation. Rather than being concerned with the closeness of an equilibrium state 
to a thermal state of some kind it stresses that a system can be considered truly thermal 
only if it is not correlated with any other relevant system, as otherwise phenomena such 
as anon ialou s heat flow, which go against the predictions of thermodynamic, can occur. 
In Ref. 326| a subsystem S is called approximately thermal relative to a reference system 
R, if the joint state psn is close in trace norm to a state of the form tts (8) Pr with tts 
being a suitable micro-canonical state. Decoupling techniques can be used to show that 
whenever certain entropic inequalities are fulfilled then most joi nt ev olutions of S, R, 


and an environment lead to approximate relative thermalisation 


326 


7.6. Investigations of thermalisation in concrete models 

A large body of literature is concerned with investigations of thermalisation in specific 
quantum many-body models. Many of those studies are directly concerned with testing 
a variant of the eigenstate thermalisation hypothesis (ETH) at the level of individual 
eigenstates. The various eigenstate thermalisation hypotheses differ in whether they 
conjecture closeness to a micro-canonical or a canonical average and concerning the type 
of observables they supposedly apply to. Few body and (approximately) local observables 
are the two most frequently encountered choices. 

The ETH gained wide popularity after the series of influential works js^, 0, l327| . 
They identify the ETH as the mechanism for thermalisation and study its breakdown 
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close to integrability in systems of hardc ore b osons by means of exact diagonalisation. 
Similar conclusions are reached in Ref. 18l| for fermionic systems and the ETH is 




compared with other signatures of quantum chaos. Ref. j45| represents a sound and 
detailed study of the validity of the ETH in systems with a tunable integrability breaking 
term by me ans o f hnite size scaling and varying the strength of the integrability breaking 
term. Ref. 300| discusses the validity of the ETH in a simple model i naki ng use of a 


numerical technique that does not rely on exact diagonalisation. Ref. 328j presents a 
detailed study of the fluctuations of diagonal and off-diagonal matrix elements in the 
energy eigenbasis of certain physical observables in Heisenbe rg sp in chains that conhrms 
that in the non-integrable case the ETH is fulhlled. Ref. Oi hnds a breakdown of 
thermalisation and the ETH in a non-integrable model of spin-less fermions with a 
powe r law like random hopping term if the decay exponent is sufficiently large. In 
Ref. j329| the ETH is connected with von Neumann’s quantum ergodic theorem and it 
is confirmed that after a quench from a model that fulfils the ETH to one that does not 
(for the system being integrable), a system can still behave thermal. Ref. performs 
a finite size scaling analysis of the validity of the ETH in the (integrable) Lieb-Liniger 
model and demonstrates that a weaker version of the ETH still holds that is sufficient 
to guarantee apparent thermalisation for initial states that occupy sufficiently many 
energy eigenstates. 

On top of that, a large body of literature exists that investigates all sorts of 
aspects of thermalisation and how various properties of t he Hamiltonian a nd initial 
state influence it — in fact, this has a long history 49, 330|: Refs. H 0, 0 H, H 
numerically and experimentally study transport and thermalisation in the (non- 
integrable) Bose-Hubbard model. Ref. j0| focuses on the (fermionic) Hubbard model 
at small interaction strength. Using flow techniques the temporal evolution investigated 
and is found to go trough three distinct regimes. After an initial build-up of correlations 
the system exhibits an intermediate, non-equilibrium, pre-thermalised, quasi-steady 
state and then eventually becomes indistinguishable from being thermalised. Similar 
pre-thermalisation effects - building upon the theoretical understanding discussed in 


Ref. 


331 


have been observed in Ref. 


332| in systems of spin-less fermions, and in Ref. 


204| in Bose- an d Fe rmi-Hubbard models, 

in instances of non- 


333 


in Ref. 

integrable quantum spin chains. Similar pre-thermalisation effe cts were also fou nd in 


336 


systems evolving under stochastically changing Hamiltonians 334 l335| . Ref. 
studi es th e energy relaxation and thermalisation of hot electrons in quantum wires. 
Ref. 310| investigates the influence of the initial state on t he time scales on which 


thermalisation happens in a non-integrable model. Ref. 174| looks at local and non¬ 
local conservation laws and how they influence the non-equilibrium dynamics and 
thermalisation. The equilibration and thermalisation after a quench to a coupled 
Hamiltonian of two identic al un coupled systems initially in thermal states at different 
temperatures is studi ed in 213| and thermalisation to a state close to a joint thermal 
state is found. Ref. j337l| investigates conditions for equilibration and thermalisation 
(albeit in the sense of convergence in the limit t —)■ oo) in the well studied model of 
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a central harmonic oscillator linearly coupled to an infinite number of other oscillators 
starting from a non-thermal product initial state. 

The bottom line of this large amount of investigations is as follows: The energy 
eigenstates in the bulk of the spectrum, i.e., those to energies that are neither too low nor 
too high, of sufficiently large and sufficiently complicated composite quantum systems 
seem to generically fulfil some variant of the ETH for certain physically meaningful 
local or few body observables. Equilibration of local and few body observables is a very 
common phenomenon shared by almost all reasonable locally interacting many-body 
models for wide classes of initial states. This in turn implies that those systems which 
fulfil a suitable variant of the ETH also almost always dynamically thermalise after 
being started in a non-equilibrium initial state, like for example after a quench. 

Many studies moreover conclude that the fulfi l men t of the ETH i s related to non- 
integrability or chaos 44, 45, 45, 84, 118 . 178 . 181 . 812 . 818 . 329, 888 1. Moreover, it is 
often suggested that systems fu lfil the ETH and thermalise if and only if they are non- 


integrable [33|, 13^, 155|, I889L 134Q|, disordered systems being an important exception |254 
(see also Section |83])- What precisely the term non-integrable means in the context of 


many-body quantum mechanics and especially in systems without a well-defined classical 
limit and the relation between fnon-H ntea r abili t y an d (exact) solvability are, however, 
still the subject of a lively debate 
in Section [9l 


48l . l314j. I 3411 . 1342]. We will come back to this issue 


8. Absence of thermalisation and many-body localisation 

In the past section we have identified and discussed conditions under which locally 
interacting many-body systems exhibit thermodynamic behaviour like equilibration and 
thermalisation. Complementing these considerations, in this section we will identify and 
discuss scenarios in which thermalisation is prevented. In particular we will be concerned 
with situations in which a system fails to thermalise locally because small subsystems 
retain memory of their initial conditions. Quite intuitively the presence or absence 
of thermalisation is intimately linked to the transport properties of a system. After 
all, for thermalisation to happen stating from a non-equilibrium initial condition, some 
equalisation of initial imbalances in, for example, the spatial distribution of energy or 
particles must happen. We will see that the concept of entanglement, to what extend 
it is present in the eigenstates of a Hamiltonian and how it spreads through the system 
during time evolution, will be of great use to gain insights into such transport processes. 

We start by formulating what we mean by absence of thermalisation and in 
particular define violation of subsystem initial state independence. The main part of this 
section will be dedicated to the discussion of physical situations in which one naturally 
expects such an absence of thermalisation to happen: systems with static disorder in the 
Hamiltonian. This will lead us to the intriguing phenomenon of (many-body) localisation, 
a type of localisation in which disorder and interactions interplay in a subtle fashion. In 
fact, one of the currently discussed definitions for many-body localisation in quantum 
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systems takes the absence of thermalisation as its defining feature 343l - l345| . We make an 
attempt to survey the newly emerging debate concerning this phenomenon. After a brief 
introduction to Anderson localisation we discuss properties that can be expected from 
a many-body localised phase and collect different notions of many-body localisation. 


8.1. Violation of subsystem initial state independence 

We start with dehning subsystem initial state independence. Roughly speaking, a system 
fulhls subsystem initial state independence for a certain set of initial states if changing 
only the subsystem part of an initial state from that set does not noticeably influence 
the equilibrium state of the subsystem. This can be put as follows: 

Definition 4 (Subsystem initial state independence). We say that a composite system 
with Hilbert space Ti and Hamiltonian H G 0{'H) satisfies subsystem initial state 
independence for subsystem S on average with respect to a given set of initial 
states So C iS('H) if for all p(0) G So the equilibrium state on S is sufficiently 
independent of its initial state in the sense that for every quantum channel C G 
T^ifH) with support supp(C) C S the states p{0) and C{p{0)) have the property that 
'D(Tr 5 c[$j:/(p( 0 ))], Tr 5 c[$j:^(C'(p( 0 )))]) is sufficiently small. 

If a system does not exhibit any local exactly conserved quantities, subsystem 
initial state independence, as dehned in Dehnition 01 with respect to a sufficiently 
large set of initial states So C S, can rightfully be considered a necessary condition 
for thermalisation of small subsystems, regardless of which precise dehnition of 
thermalisation is adopted. 

As was shown in Ref. 


254| subsystem initial state independence after a quench can 


be provably violated if the Hamiltonian exhibits a lack of entanglement in the eigenbasis. 
The central quantity in the argument is the effective entanglement in the eigenbasis. 
Given a bipartite spin system with V = S U B, Hilbert space Ti, and Hamiltonian 
H G 0{TL) with spectral decomposition H = Rfc we dehne for any pure state 

G S the effective entanglement in the eigenbasis as 


d' 




(143) 


fc=l 


with pk := Tr(nfc -0) the energy level populations. If the Hamiltonian is non-degenerate 
it takes the simpler form 




(144) 


k=l 


The name effective entanglement in the eigenbasis is justihed by a result of Ref. 254| . 
which bounds Rsm by a quantity that is closely related to the geometric measure of 


entanglement |346l - l348| . If the eigenstates of H are little entangled , and fj is a suitably 
chosen product state, then Rs\B{f>) is small. In fact one can show (12011254| that there 



















8 ABSENCE OE THERMALISATION AND MANY-BODY LOCALISATION 72 


exist many initial states that are perfectly distinguishable on the subsystem but that 
have both the properties needed to ensure equilibration on average of small subsystems 
according to Theorem [1] and a small Rs\b if H is non-degenerate and its eigenbasis is 
only little entangled. The type of system that are naturally expected to show such a 
behaviour, as will be discussed in the subsequent subsection, are many-body localising 
systems. The effective entanglement in the eigenbasis can be used to bound how much 
closer the reduced states on S of two different initial states can move during equilibration 
on average in the following sense: 


Theorem 13 (Distinguishability of de-phased states 254 Theorem 1]). Consider a 
bipartite spin system with V = S (J B, Hilbert space Ti and Hamiltonian H G 0{'H). 
Eor j G {1,2} let V’j(O) = V’f(O) ® V’/(0) ^ initial product states and set 

^s{j) TrB($H(V'i(0))) ihen 


V ^ (V^f (0), V-f (0)) - i?s|B(^i(0)) - i?s|B(V'2(0)). (145) 


If the state of the subsystem S equilibrates on average during the evolution under 
H for the two initial states, then the de-phased states = TrB($ji^('0j(O))) are 

the respective equilibrium states. The theorem shows that if i?('0i(O)) and i?('^2(0)) 
are both small, then the subsystem equilibrium states and cannot be much 
less distinguishable than the initial states 'i/'f(0) '0f(O). We summarise this in the 

following observation: 

Observation 6 (Absence of initial state independence). Consider a bipartite spin 
system with V = S U B and Hilbert space H. Let Hr ^ H be a subspace of dimension 
dR := dim('H/j). If dR is large and the Hamiltonian H G 0{H) has not too many 
degenerate energy gaps (see TheoremUl for details) and an orthonormal basis (|j))j£i 
for Hs exists for which 5 := max^gj^^j^fc, with 


6k := min V (Tr^ \Ek){EK\,\j){j\), (146) 


is small, then for every j,j' G [ds] there exist many initial states of the bath '0'®(O) G 
S{Hb) such that according to Theorem\^ both \j){j\ ® t/'^(0) ®'0'®(O) lead to 

subsystem equilibration on average, but despite them having exactly the same initial state 
on the bath, the corresponding subsystem equilibrium states and remain well 

distinguishable for most times during the evolution, in the sense that their trace distance 
is significantly larger than zero whenever j ^ j', because of Theorem, [TR 


A statem ent complementing Observation [6] can be found in Ref. 1071 . Section B] 


(see also Ref. 125| for a generalisation to mixed initial states and situations with initial 
correlations to reference system). There, it is shown that if the energy eigenstates of a 
non-degenerate Hamiltonian does contain a lot of entanglement, then subsystem initial 
state independence can be guaranteed. 

In a very simila r spir it as above, absence of initial state independence has also been 
studied later in Ref. 349| . which gives a condition that is necessary for subsystem initial 
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state independence. The article mostly studies a simplified version of this condition, 
which essentially demands that the reductions of most eigenvectors of the Hamiltonian 
must be sufficiently close to the maximally mixed state. 


129 


130 


More recently, initi al state i ndependence was studied in Ref. 1251 Il90| h By using 
the decQu ylinQ meth od 126l - ll28| and the formalism of so-called smooth min and max 
entropies 


The authors show that it can be decided from just looking at 
one particular initial state whether a system satisfies initial state independence for 
most initial states. Moreover, they give sufficient and necessary entropic conditions for 
initial state independence of most initial states. The authors consider both subsystem 
initial state independence and bath initial state independence, i.e., the independence 
of the equilibrium state of the subsystem from the initial state of the bath. Th e 
results concerning the absence of sub system initial state independence of Ref. 190|, 


when compared to those of Ref. 254| discussed above, have the advantage that they 
apply to specific points in time instead of time averaged states and that the subsystem 
does not need to be small. On the other hand they only hold for most/typical initial 
states. 

that numerically 


There exist several articles, including Refs . |52l . l271l . I339L 1340 


and analytically study related effects. Ref. 339| finds that the existence of few energy 
eigenstates that violate the eigenstate thermalisation hypothesis (see also S ectio n 17.21 
and in particular Definition [3]) can lead to absence of thermalisation. Ref. j34n| goes 
beyond the closed system setting and considers thermalisation and its absence in systems 
that are coupled to thermal baths and finds that certain integrable models do not 
thermalise. Ref. 52| studies quenches in a homogeneous XY quantum spin chain with 
transverse field starting in ground, excited, and thermal states. The authors find that 
after certain que nche s local observables fail to thermalise and relate this behaviour to 


criticality. Ref. 271| investigates equilibration and thermalisation in exactly solvable 


models and finds that in such models correlation functions can retain memory of the 
initial conditions. 


8.2. Anderson localisation 

With the aim to develop a better understanding of parti cle and spin transport in 
materials with impurities, Anderson in his 1958 article 350| proposed a simple model 
for quantum mechanical particles in a lattice with a random potential and showed 
that the randomness can lead to a complete suppression of diffusion or transport. 
This phenomenon became known as Anderson localisation. More concretely, the model 
studied by Anderson is a tight-binding model on a cubic lattice of dimension D with a 
single particle hopping on the lattice sites. The Hilbert space is 77 = spanned by 

vectors |x) interpreted as the state with the particle at position x G The random 
Hamiltonian of the Anderson model reads 

H{V)= Y. \x){y\ + xY^-\^)(^l 


: \x—y\=l 




(147) 
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where A > 0 and V a family of random nnmbers 14 drawn i.i.d. from a snitable 

distribntion m _ ^The first term describes hopping between nearest neighbonrs in the 

lattice (Ref. (35n| actnally also considers more general longer range hopping), while the 
second represents a random on-site potential. For re views on the Anderson model from 

For a general ove rview 


351 


aSi- 


the perspective of mathematical physics, see Refs, 
written on the occasion of the 50th anniversary of the phenomenon see R ef. 354| . For 
a good book also providing significant historical context, see Ref. 


352 


To simplify 

the discnssion we concentrate on one spatial dimension D = 1, and assnme that the 
distribntion p is absolntely continnons with a bonnded density of compact snpport. 
Because of the existence of the rigorous mathematical literature, we moreover take the 
liberty to brush over some subtleties. 

The Anderson model exhibits “localisation”. This is true in at least two different 


senses of the term |351| : First, the random Hamiltonian (I147p almost surely exhibits 
spectral localisation, meaning that it has a pure point spectrum (that densely fills all 
non-trivial open intervals contained in its almost sure spectrum) and that the associated 
eigenfunctions are exponentially decaying. The latter means that HiV) has a complete 
countable set of eigenvectors {\Ek)}k obeying 


3C > 0,? > 0: VIBt) 3i„: \/x: |(x|£:t)| < 


(148) 


Here ^ > 0 is called the localisation length scale. That is to say: Almost all Hamiltonian 
eigenvectors are exponentially clustering. For systems in in more than one dimension 
a similar statement holds either at sufficiently high disorder, or for energies sufficiently 
close to band edges. Note also that these results can be extended to finite systems, 
localisation then holding with high probability instead of almost surely. 

Second, the model exhibits almost surely dynamical localisation. This can be 
captured as follows: The random Hamiltonian HiV) is said to exhibits dynamical 
localisation in an open interval I if 

3C >0,^>0:Yx,y eZ^: E (sup(x| (149) 

tes. 

where H/ is the spectral projector corresponding to the interval I. Dynamical 
localisation implies a complete absence of transport. In particular it implies that all 
moments of the “distance from the origin” operator |X|, which acts like |X| |a;) = |x| |x), 
are bounded uniformly in time, i.e., that 

\/p > 0, x,y e Z^: sup |4)|| < cxD (150) 

for 14) any state vector with compact support almost surely. Despite the hopping term 
in the Hamiltonian, which in the absence of the disordered potential allows the particle 
to move through the lattice, in the Anderson model the particle “gets stuck”. The 
probability of finding it on a site different from its starting point decays exponentially 
with the distance uniformly for all times. Dynamical localisation implies spectral 
localisation by the RAGE theorem, but the converse is not necessarily true. 
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The above discussion immediately carries over to, for instance, disordered quadratic 
fermionic systems in which the quasi-particles do not interact. In one spatial dimension, 
the corresponding Hamiltonian reads 


= Y + CjI) + 


(161) 






As the fermions do not interact in such quadratic models, each of them behaves as in 
Anderson’s model and conductivity is completely lost. The Hamiltonian can also be 
readily related to local spin models by virtue of the Jordan Wigner transformation and 
Anderson’s conclus ion c an be argued to still holds for interacting particles when the 
density is very low j35fll| . 


8.3. Many-body localisation 

An intriguing, and in large parts still unsettled, question is whether and in what precise 
sense localisation survives in systems with intera ctions and signihcant particle densities. 
This issue has already been raised by Anderson 350L l355| . One expects that in models 
with sufficiently strong disorder some characteristics of Anderson localisation should 
survive in the presence of interactions. This new phase of matter is commonly referred 
to as the many-body localised (MBL) phase. In which sense and under which conditions 
this is in fact true is the subject of ongoing investigations. In fact, there is no complete 
consensus yet as to what precisely constitutes many-body localisation in the hr st p lace. 
In the following we collect and compare different points of view (see also Ref. 356| ): 


Suppression of transport and localisation in Eock space: The influential Ref. 
gives signihcant evidence that indeed, localisation in the dynamical sense |17n| is 
maintained in the presence of interactions, by invoking a combinatoric argument 
and perturbation theory; For sufficiently high disorder and sufficiently low 
temperature (and absence of a coupling to an external heat bath) it is demonstrated 
that the conductivity in a disordered fermionic lattice system is exactly zero. The 
argu ment makes use of the concept of localisation in Eock space introduced in 
Ref. 


358 (see also Ref. 


3591): Consider a fermionic system whose Hamiltonian is 
a sum of a quadratic Hamiltonian Hq and an interaction term Hi. A many-body 
state is called localised if it is a superposition of only few of the (quasi-particle) 
eigenstates of Hq. If the relevant eigenstates of Hq + Hi are localised, i.e., all below 
a certain critical energy (called a mobility edge), in this sense then below a critical 
temperature the system exhibits zero conductivity. 

Absence of thermalisation: Closely related to the characteristic suppression of 
transport in localised systems is the absence of thermalisation due to a violation 
of initial state independence (see also Section [8T|) . This is a natural expectation, 
since one does not expect the ei gens tate thermalisation hypothesis (ETH) to be 
valid within the MBL phase. Ref. 343| for example studies a disordered Heisenberg 
chain and hnds a violation of the ETH and interprets this as one of the dehning 
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111 


features of MBL. As was shown in Ref. 254| violation of initial state independence 
in disordered systems can be understood as a consequence of a lack of entanglement 
in the eigenbasis. 

Clustering of correlations'. Another definition puts the clustering of correlations 
of eigenvectors into the centre of attention. For quadr atic models, it is expected 
that all Hamiltonian eigenvectors satisfy an area law 226| for the entanglement 
entropy. This means that in one dimension the von Neumann entropy of the 
reduced state of any energy eigenstate on any subsystem is upper bounded by 
a constant independent of the size of the subsyst em. A similar feature has also 
been suggested as a possible definition for MBL 360| : One then calls a system 
many-body localising if, not necessarily all but at least many (in a suitable sense), 
eigenstates satisfy an area law. A proof of a uniform area law (in expectation) was 
recently giv en fo r the case of the XY chain with disordered transverse magnetic 
field in Ref. j36l| . Numerically, there is strong evidence that this is indeed the case 
in disordered interacting models, at least below a mobility edge 


35 


so an energy 

scale that separates the MBL fr om t he “ergodic” regime. A connection with the 
dynamical aspect of localisation 170| was recently established in Ref. 


171 , where 


IV 


it was shown that invoking different readings of dynamical localisation, it follows 
that either all or many energy eigenvectors follow an area law. 

Logarithmic growth of entanglement: A yet different feature of MBL that has 
been suggested as a defining property is the logarithmic growth of entanglement 
in time. While the entanglement of generic local Hamiltonian models is expected 
to grow linearly in time (see also Section [375]) . quadratic models show a saturation 
of entanglement entropies. This is provably so, as a consequence of the complete 
suppression of transport. In interacting disordered models a slow — logarithmic in 
time — b ut unbounded growth of entanglement has been numerically observed 
362, l363| . This feature is perfectly compatible with individual eigenstates 


exhibiting little entanglement. In fact, maybe counter-intuitively, an unbounded 
growth of entanglement already follo ws from localised Hamiltonian eigenstates 
together with a generic spectrum js^. 

Approximately local constants of motion: Another discussed possibly defining 
feature of MBL is the prese nce of an extensive number of exactly or approximately 
local constants of motion j365l | . with the feature that the Hamiltonian c an be 


expressed entirely as a sum of polynomials in these quantities 1691.1364 1366 


If indeed such local constants {Aj} of motion can be found, {g, iL)-local in the above 
sense for a suitable function g, violation of subsystem initial state independence and 
absence of thermalisation follow immediately: Since 


tr(Ajp(t)) = tr(/l,f)(0)) 


(162) 


is true for all times t, the system can possibly only equilibrate to a state that has 
the same values for these conserved quantities (see Section 15.2p . 
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Other, quite sophisticated features also follow from the presence of such 
approximately local constants of motion. For example, one can derive a Lieb- 
Robi nson type bound with a causal “cone” that grows only logarithmically in 
time 


169| (see also Section 13.Sp . From such a bound one can deriv e th at the 
entanglement entropy can grow at most log arith mic in time 0, H II3|- An 


similar bound has been also obtained in Ref. 
and Ref. 


168| for a disordered XY spin chain 


170| improves upon this by giving a zero velocity Lieb-Robinson bound in 


disorder average for this model. 

A disadvantage of that dehnition is that it is far from clear how to construct or 
identify such approxim ately local constants of motio n in the hrst place. In the 
disordered Ising model 367| and the XXZ spin chain 365| this is indeed possible. 


but no general strategy has yet been found 366|. Several of these dehning features 


have also be en co nnected and made plausible using real space renormalisation group 
approaches 


368 


(vi) Poissonian level statistics: Properties of the energy level statistics of Hamiltonians 
have proved to be useful indicators for quantum chaos and integrability. It is 
hence natural to investigate the influence of disorder on the level statistics. A 
key quantity in this context is the distribution of gaps between consecutive energy 
eigenvalues. For quadratic models, this distribution typically is a Poissonian one. 
For interacting models, it is generally expected to follow a Wigner-Dyson type 


ai, H, H-Enl. 


distribution 

For typical many-body localised models, there i s strong numerical evidence that 
the distribution is again close to Poissonian 35, 343 . 344 . 358, l369l |. This can be 
quantihed by the ratio of consecutive level spacings 


mm 


C = 




max((5j, 5j+i) ’ 


(163) 


with Sj = Ej — Ej_i being the gap between consecutive energy levels. In the 
non-localised phase one can expect from Wigner’s surmise leading to the Gaussian 
orthogonal (GOE) or unitary (GUE) ensemble that a disorder average of Vj yields a 
value close to rcoE ~ 0.5307 or rcuE ~ 0.5996, while for a Poisson distribution, that 
one expect in the MBL phase, one obtains on average rpoisson = 2 log 2 — 1 0.3863. 

An extensive numerical analysis of this ratio of consecutive level spacings has 
been performed in Ref. 35| for the random held Heisenberg model on a ring. 
Also Ends excellent agreement of the position of the cross over in the consecutive 
level spacings statistics with that of an area law / volume law crossover of the 
entanglement entropy and a crossover in the scaling of the participation entropies (a 
quantity closely related to the inverse participation ratio and the effective dimension 
discussed in Section IT^ . This work also calculates these quantities in an energy 
resolved fashion and Ends that it can happen that for low energies a system shows 
strong signatures of a Poissonian distribution, while for higher energies, it resembles 
a Gaussian orthogonal ensemble consistent with the existence of a mobility edge in 


























































9 INTEGRABILITY 


78 


interacting systems. 

(vii) Power-law approach to equilibrium: Ref. j372l| identified a power law (as opposed to 
exponential) approach to equilibrium of local observables as a characteristic feature 
of the MBL phase. In addition the MBL phase has also been found to exhibit a 
slow power law like decay of the disorder aver age o f the survival probability, i.e., 
the hdelity with the initial state, at long times j873l| (see also Section [R7|) . 

Each of the dehnitions above only capture part of the intricate phenomenon of 
many-body localisation. In particular it is far from clear whether disorder is really 
necessary to realise all of the above qualifying features of many-body localisation. 
In fact, drawing intuition from classical glassy systems it is possible to design clean 
spin systems that show many of the features one would expect from a system with 
a MBL transition 374| . In fact many of the static properties discussed above may 
also occur in certain (nearly) int egra ble models without any disorder. Concerning the 
dynamical features of MBL, Ref. 375j for example demonstrates that the slow growth of 
entanglement entropies (Eyl) can also exist in clean syster ns. The same holds for long lived 
metastable states that break a symmetry of the system 173l.l374j - l376| (see also the effect 
of pre-thermalisation discussed in Section 17^ . It would be specihcally intriguing to see 
rigorously whether fully translation invariant models can exhibit dynamical localisation 
in the sense of property ([n]) even for inhnite time, similarly as this is possible for 


254 


interacting disordered models 

In several physical architectures, A nder son and many-body localisation has already 
been experimentally observed. 


Ref. 


3771] discusses an experimental observation of 


exponential localisation of a Bose-Einstein c onde nsate in a random potential generated 
with a laser speckle pattern. The recent Ref. 3781] experimentally probes the many-body 
localisation transition in a system of ultra cold fermions in a disordered optical lattice 
by measuring the imbalance between the occupation of even and odd sites starting from 
a situation where only even sites are occupied, resembling the experimental situation of 


Ref. 2g. For sufficiently strong disorder the imbalance is found to no longer decay to 


zero even for long times, reflecting the absence of thermalisation an d the violation of a 
subsystem initial state independence very much in the spirit of Ref. 


2541. 


9. Integrability 

In this section we discuss a concept that has recently started playing an important 
role in the debate on equilibration and thermalisation in closed quantum systems 
— the concept of integrability. It is often suggested or claimed that non-integrable 
systems thermalise, while integrable ones do not. This wisdom has become folklore 
knowl edg e that is often in voked in discussions and talks o n the topic (compare also 
Refs. 0 0 H S S H, Ei, EtsI, EmI E3, H, H-H ). in the following, we will 


briefly review the current state of affairs concerning the usage of the term (quantum) 
integrability in the context of equilibration and thermalisation in closed quantum 
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systems, comment on the concept of integrability and investigate to which extend 
the circumstantial evidence concerning the connection between (non-)integrabihty and 
thermalisation can be substantiated. 

To that end we will hrst recapitulate the dehnition of integrability in classical 
mechanics and then discuss obstacles for a generalisation of the concept of integrability 
to th e quantuni se tting. This assessment is largely based on the previous works 
Refs. 254 l3l4 l379| . We hnish with some speculations on the connection of quantum 


(non-)integrability and computational complexity. 


9.1. In classical mechanics 


In classical mechanics |38n| (Liouville) integrability is a very well-dehned concept. 
Consider a classical system with n G TY degrees of freedom, each associated with a 
coordinate qk and a corresponding momentum p^. Then, in the Hamiltonian formalism, 
the 2 n canonical coordinates {qk)k=i {pk)k=i span the phase space 5^ of the system 


282 


We assume that the Hamiltonian function JY: ^ R, i.e., the energy 
functional, of the system is time in depen dent. It then governs the time evolution of 
the system via Hamilton’s equations 


380 


Vfc G [n]: 


Pk = 


dJT 

dqk 


Qk = 


dpk 


(154) 


The dot indicates the derivative with respect to time of the corresponding quantity, i.e., 
qk is the temporal change of qk. Integrating these differential equations yields the phase 
flow : 5^ —)■ e5^, which maps the initial phase space vector of a system at time 0 to 
that at time t G R. Dehne for any two functions F, G: R their Poisson bracket 

(F, G) : ^ ^ R as 

(155) 


(F, G) := lim — F o g-^ 

where o de note s function composition. It turns out that (•, ■) is bilinear and skew- 
symmetric j38fl| . A function F : R is called a first integral of motion under the 

evolution induced by if {F,JY) = 0. More generally, if for F, G: ^ R it holds 
that (F, G) = 0, then F and G are said to be in involution. We can now dehne Liouville 
integrability: 


Definition 5 (Liouville integrability 380|). A classical system withn degrees of freedom 
is called (Liouville) integrable if it entails a sequence {Fk)k^i of n independent first 
integrals of motion that are pairwise in involution. 

Liouville’s theorem for integrable systems shows that Liouville integrable systems 
can be solved, i.e., the time evolution can be explicitly calculated, in a systematic way 
by quadratures, i.e., by direct integration of differential equations: 


Theorem 14 (Corollary of Liouville’s theorem for integrable systems 380| ). If a system 
is Liouville integrable, its time evolution can be solved by quadratures. 
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In more detail; Liouville’s theorem for integrable systems essentially ensures that, 
given the initial values of all canonical coordinates, the time evolution of an integrable 
system is confined to a smooth submanifold of the phase space that is diffeomorphic to 
an n-dimensional torus. The time evolution is quasi-periodic and can be described in 
terms of the so-called action angle coordinates {(pk)'k=i that parametrise the torus. 

The action angle coordinates can be explicitly constructed from the sequence 
{Fk)'k=i of ^ independent first integrals of motion and the values fixed for them. Fixing 
different values for the n first integrals of motion results in different tori. In the 
coordinate system of the action angle variables the equations of motion are given by 
2 n simple ordinary differential equations of the form = 0 and (pk = Wk, with Wk 
being constants that depend on the values that were fixed for the n first integrals of 
motion. 

If a Liouville integrable system is perturbed, then the time evolution is generally 
not confined to a torus anymore and cannot be derived in a systematic way. For small 
perturbations the Kolmogorov-Arnold-Moser (KAM) theorem ensures, under a so-called 
non-resonance condition^ that most t ori are o nly deformed and the time evolution on 
them is then still quasi-periodic 1851 . l38ll . l382| . 

In summary we have: Integrability in classical systems implies systematic solvability 
and thereby yields a qualitative classification of classical systems. Liouville integrable 
systems are not ergodic (see Section lA.ll) in the sense that their phase space trajectory 
does not explore the whole phase space, but is confined to a portion of it. Whether or 
not this implies that integrable systems cannot thermalise depends on the definition of 
thermalisation, but the motion of the system is quasi-periodic and hence no convergence 
of the state of the system in the limit t —)■ cxd is possible. Non-integrability in classical 
systems is not sufficient for ergodicity or chaos and hence also not sufficient for notions 
of mixing or thermalisation based on these concepts. Still, the concept of Liouville 
integrability yields a classification of systems with strong implications for their physical 
behaviour. 


9.2. In quantum mechanics 

Ideally, a notion of quantum integrability should yield a classification that divides 
quantum systems into two classes, integrable ones and non-integrable ones, with 
markedly different physical properties. In addition it should, in some sense, be 
a generalisation of Liouville integrability. However, if one tries to generalize the 
concept of Liouville integrability to quantum syste ms i n a straight forward manner, 
one immediately encounters problems (see also Ref. 34l| j: 

Consider a quantum system with d dimensional Hilbert space T-i and Hamiltonian 
H G 0{'H). An orthonormal eigenbasis {\Ek))f.^i of H, with corresponding 
eigenvalues {Ek)k=iJ can always be constructed in a systematic way by diagonalising 
the Hamiltonian. The time evolution of an arbitrary initial state vector \fi) G "H is then 
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given by 


k=l 




(156) 


with (pk{t) := arg((i?fc|'^)) — Ekt, where arg is the argument function, i.e., for every 
c G C, c = |c| The overlaps {Ek\'4’) can also be calculated systematically, so the 

time evolution of a (finite dimensional) quantum system can always be obtained in a 
systematic way for any Hamiltonian and any initial state. 

The analogy to the situation of Liouville integrable systems is striking: The 
dimension d plays the role of the number n of degrees of freedom of the system in 
the classical case. The linear functionals \{Ek\ ■ |: "H —)■ M, induced by the eigenvectors 
of H, are analogous to the first integrals of motion in Liouville’s theorem on integrable 
systems, and the time independent moduli of the overlaps \{Ek\'il^)\ = |(i?fc|'0(f))| play 
the role of the values fixed for these constants of motion. Finally, the functions (fk in 
the right hand side of Eq. (11561) satisfy differential equations analogous to those of the 
action angle variables, namely (fk = Ek, and the time evolution indeed happens on a 
d-torus. As in the classical case, the specific torus to which the evolution is confined 
depends on the values fixed for the conserved quantities. It seems that the dynamics of 
quantum systems is far less rich than that of classical systems. This constitutes a major 
obstacle for a good definition of quantum (non-)integrabihty. 

Before going on, it is reasonable to give a set of conditions that a good notion of 
non -)integrabilitv for quantum systems should satisfy. It seems reasonable to demand 


314| that a definition of quantum integrability should: 

(i) have implications for the physical behaviour, 

(ii) be applicable to a large class of quantum systems, 
be unambiguous, 
be decidable for concrete models. 


Ill 


IV 


Unfortunately almost none of the existing frequently used notions of quantum 
integrability seems to fulhl all these criteria. The following is a list of some of 
the definitions of quantum integrability that have been introduced, together with 
exem plary refe r ences in w hich the corresponding definition appears or is used (see also 
Refs. 


254 l3l4 I 379 L l383| ). A system is quantum integrable-. 


If it exhibits n physically meaningful m utual l y cq r nmu t ing conserved quant ities that 
are in some sense independent 42, 49, 144 . 279 . 342 . 384 1 (see also Ref. j379| and 
the r eferences therein) or depend linearly on some parameter of the Hamiltonian 

1^ . 

If it is integrable by the Bethe ansatz 


111 


If it exhibits nondiffractive scattering 


14 , 51.147113831 


3831 . 


(iv) If it has a classical limit that is integrable 


385 
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v) If its level statistics follows a Poi s son l aw and i s non-integrable if it is of Wigner- 


48, 

49, 

178, 

185, 

186, 

343, 

386 

^ _, 

CO 

00 

00 


VI 


vii 


vill 


If it does not exhibit level repulsion 389l. 1390 


If (many of) its e igenfunctions can be labeled in a certain way with quantum 
numbers 


3421. l39n|. 


If it is exactly solvable in any way jd^. 0, l342l. l39l| . 


In the hrst dehnition both physically meaningful and n can have very different meanings. 
It can, for example, in the case of composite systems, refer to local operators. The 
number n is usually taken to be equal to the number of degrees of freedom o f the model 
or the number of constituents in the case of composite systems. In Ref. 34 1| n can 
be any number between zero and d — 1, and models are then classihed according to 
this number n. Similarly, independent can have several meanings, linearly independent 
and algebraically independent being popular choices. Usually all quadratic systems and 
systems such as the Hydrogen atom fall in this category. Models that are integrable 
according to this dehnition are often also integrable according to one of the other 
dehnitions given above (especially Dehnitions (El]), (Iviil) . and (Iviiij) ). Many of the 
dehnitions of integrability of this type suffer from the severe problem that if the dehnition 
is taken seriously, all quantum systems classify as integr able and hence it violates 
Condition (Ej) (see the discussion above and Refs. 341 . l379| for a critic of such notions 
of integrability). 

Dehnitions ([n]), (En)) . and (EvT) are only applicable to restricted classes of models and 
hence violate Condition (El]) in the above list. The same holds, although argua bly in 


341 


a weaker sense, for Dehnition fl\a|) and the version of Dehnition (Ej) of from Ref. 
which are only applicable to systems which have a natural tuning parameter. 

Dehnitions ^ and fl^ suher from the problem that also certain models that 


Oyj) and (jyi 

are usually regarded as integrable can have spectra that would classify them as non- 


integrable 31 si . l390| . In fact, it is trivial to construct such examples. In a composite 


systems of, say, spin-1/2 systems, one can simply take a Hamiltonian that is diagonal in 
the usual Pauli-Z product basis and which hence should clearly be classihed as integrable 
and set its spectrum to be that of some non-integrable model. Moreover, natural tunable 
models are known that exhibit thermodynamic behaviour in both the regime that 
would be classihed as integrable and the one that would be classihed as non-integrable 
according to this dehnition j^. Hence, these dehnitions violate Condition fEnl) and (E]). 

Especially Dehnitions (E|) and flviiil) suher from the problem that it might simply be 
a lack of imagination that preyents one from hnding a relevant conserved quantity or 
from solving a given model and thus violate Condition (E^ . This is well illustrated by the 
recent (p artia l) solution of the Rabi model, which was long thought to be non-integrable 
(see Ref. 


342| and the references therein). 


In conclusion, it seems fair to say that the question of how to dehne integrability 
in quantum mechanics is still to some extent open and even more so for quantum non- 
integrability. At the same time a number of very useful and promising indicators of 
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and proposals for a definition of integrability exist (see also Refs. 3l4 l34l| for more 
background information and recent proposals). Still, general claims that “non-integrable 
quantum systems thermalise” seem unjustified at present. 


10. Decay of correlations and stability of thermal states 

In this section we will somewhat depart from the pure state quantum statistical 
mechanics approach, as we will take the canonical ensemble for thermal states for granted 
and turn to a study of structural properties of such thermal states. This will bring us 
to the seemingly innocent question; What is the meaning of temperature on very small 
scales and in which sense is temperature really intensive^ as is paradigmatically claimed 
in thermodynamics? The problem with assigning locally a temperature to a small 
subsystem of a global system in a thermal state is the following: Interactions between 
the subsystem and its environment generate correlations that can lead to noticeable 
deviations of the state of the subsystem from a thermal state. Given only a subsystem 
state, there is no canonical way to assign a temperature to the subsystem. We shall call 
this the locality of temperature problem. 

This proble m ha s been addressed in Refs 
studied in Ref. 


392 


395 


, and more recently extensively 
396| . There, three theorems are proven; A truncation formula, which 


allows to express the influence of sets of locally interacting Hamiltonian terms on the 
expectation value of an observable in the thermal state of a locally interacting quantum 
system in terms of a correlation measure. A clustering of correlations result, which 
shows that above a universal critical temperature this correlation measure exhibits 
an exponential decay. And hnally, a result that ensures local stability of thermal 
states above a universal critical temperature and thereby partially solves the locality of 
temperature problem. 

10.1. Locality of temperature 

These results build upon and si gnificantly go beyond previous resu lts on clustering of 


correlations in classical systems 39711398| . for quantum gases 


400 


399| . i.e.. translational 
4021. For the latter 


invariant Hamiltonians in the continuum, and cubic lattices 
systems the existence and uniqueness of thermal states in the thermodynamic limit at 
high temperatures is proven and analyticity of correlations can be derived. Moreover, 
in the regime of high temperatures, n-point correlation functions have bee n sh own to 
cluster for spin gases 39811399] and translational invariant bosonic lattices 


402 


To begin the more detailed discussion, we introduce a quantity that measures 
correlations. We dehne for any r G [0,1], any two operators A, R G B{'H), and any 
quantum state p G S{'H) the generalised covariance 

cov^p{A,B) := Tr(pMp^-"R) - Tr(pA)Tr(pR). (157) 

The choice r = 1 gives the usual covarianc^. As a side remark, the quantity covHA, B) 


For the fine print see [396 1. 
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also appears in studies of one dimensional models 403], where it can be written as a 
different times correlation function in terms of the transfer matrix of the system. 

The reason for introducing the general definition here is that it naturally appears 
in the truncation formula. Before we can state it we need one last piece of notation. 
For any subsystem X C V let Xg C S the set of edges that overlap with both X and 
its complement, i.e., 


Xa:={y Fnx^0Aynxv0}- 

We extend this notation to operators A G B{'H) and define 


(158) 


As - {Y 6 £: Fnsiipp(/1) # 0 A y n supp'(/l)}. 


(159) 


Theorem 15 (Truncation formula 3961. Corollary 1 and 4]). Consider a spin or 
fermionic system with Hilbert space Ti and let H G 0{'H) he a locally interacting 
Hamiltonian with edge set 8. Let B G 8 and define for s G [0,1] the interpolating 
Hamiltonian H{s) := H — [1 — s) YIxgBq Then, for any operator A G B{'H) with 
supp(a4) G B it holds that 





cov 


0 Jo 


9ms)m Hx) dr ds . (160) 

X&Bq 


The left hand side of Eq. (llOOh is the difference between the expectation value of A 
in the thermal states of the Hamiltonian Hb with only the terms contained in the region 
B and the full Hamiltonian H. The truncation formula quantifies how the expectation 
value of A changes when the terms that couple B to the rest of the system are added 
or removed, hence the name, and tells us that this change can be expressed exactly in 
terms of the generalised covariance. 

It is important to note that Eq. fll60p is an equality. The generalised covariance 
exactly captures the response of expectation values in the thermal state to truncations 
of the Hamiltonian. The truncation formula tells us that the response of the expectation 
value is small if and only if the right hand side of Eq. (llbOp . which is an average over 
the generalised covariance times /3, is small. In other words: 

Observation 7 (Locality of temperature |396| ). Temperature can be defined locally on 
a given length scale if and only if the averaged generalised covariance is small compared 
to 1/fl on that length scale. 

We will see shortly that if locally interacting spin or fermionic lattice systems are at 
a sufficiently high temperature, then the generalised covariance cov^^^j(a 4, H) between 
any to operators H, H G B{TL) decays exponentially with the graph distance d{A,B) 
between their supports. 


10.2. Clustering of correlations in high temperature thermal states 

The following theorem applies to all Hamiltonians whose interaction (hyper)graph has 
a finite growth constant. To explain what this means we need some additional notation. 
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A subset F <Z S of the edge set connects X and y if F contains all elements of some 
sequence of pairwise overlapping edges such that the hrst overlaps with X and the last 
overlaps with Y and similarly for sites x,y ^ V. A subset F C of the edge set S 
that connects all pa irs o f its e lements is called connected and connected subsets F are 
also called animals 404 . kobl l . The size |F| of an animal F is the number of edges it 


contains. It turns out that for many interesting (hyper)graphs the number of animals of 
a given size that contain a given edge grows exponentially with the size, but not faster. 
That is, they have a hnite growth constant. More precisely, the growth constant of a 
(hyper)graph Q = (V, is the smallest constant a satisfying 


V/c e Z^: sup \ {F C S connected: X G F A |F| = fc}| < a* 
vef 


(161) 


For example, the growth constant a of the interaction graph of nearest neighbour 
Hamilt onian s on D dimensional cubic lattices can be bounded by 2 F e (see Lem ma 2 
in Ref. 4n4| ). Moreover, there is a hnite growth constant a for any regular lattice 405], 


and there exist upper bounds on the growth constants of so-called spread-out graphs 


404] that make it possible to bound the growth consta nt of the interaction hypergraphs 

Where /-local fc-body on 


396 


of all /-local /c-body Hamiltonians on regular lattices 
a regular lattice means that V can be mapped onto the sites of a regular lattice such 
that S contains only subsystems which consist of at most k sites that are all contained 
in a ball (measured in the graph distance of the regular lattice) of diameter /. Apart 
from all /-local /c-body Hamiltonians on regular lattices this also makes the following 
results indirectly applicable to systems with exponentially decaying interactions (such 
Hamiltonians can be exponentially well approximated by /-local A:-body Hamiltonians) 
but not to Hamiltonians with algebraically decaying interactions, such as for example 
Coulomb or dipole interactions. We can now state the clustering of correlations result: 

Theorem 16 (Clustering of correlations at high temperature 


396. Theorem 2 and 


4]). Consider a locally interacting system of spins or fermions with Hilbert space Ti 
and Hamiltonian H G 0{'H) with local interaction strength J := max^e^ ll^xIL 
and interaction (hyper)graph Q = (V, £^) with growth constant a. Define the critical 
temperature 

(I* := In ((1 + ^l + A/a)/2) /(2 J) (162) 

and the thermal correlation length 


e(/5) := Il/ln(ae2|/^l^(e2|^l^-l))| 


(163) 


Then, for every |/5| < (3*, parameter r G [0,1], and every two operators A, B E B{Ti) 

with d(A, F)>^(/3 ) |ln (ln(3) (1 — | ; 


M FIK ^ min(|Aa|, l^al) PIL PIL ^-YAB)/e(y) 

cov^(^)(A,F)|< in(3)(l-e-V«C)) 


(164) 


The above theorem implies that in thermal states above the critical temperature 
the correlations between any two A, B E B{'H) decay exponentially with their distance 
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d(y4, B). Importantly, the critical temperature 01621) is independent of global properties 
of H but only depends on the local interaction strength J and the growth constant a 
of its interaction (hyper)graph. 

In the context of this work, the most important implication of Theorem [16] is the 
following result, which proves stability of thermal states above the critical temperature 
against local perturbations. More precisely, it shows that changing the Hamiltonian of 
a locally interacting quantum system only outside of a subsystem B has only limited 
influence on how thermal states to temperatures above the critical temperature look like 
in the interior S' C H of i? if the distance between S and Bq is large enough; 


Theorem 17 (Universal locality at high temperatures 396], Corollary 2 and 5]). Let 
H be a Hamiltonian satisfying the conditions of Theorem lTR let /3* and ^(/3) be defined 
as in Eg. (11621) and Eg. fll63p . let \f3\ < (3*, and let S <Z B O V be subsystems with 
d{S,Ba)>m lln(ln(3)(l-e-V«U))/|^a|)|. Then 


T) 


(165) 


where u := 4 |Sa| \Bq\/ ln(3). 

If the conditions of the above theorem are met and the interior subsystem S is 
sufficiently far from the boundary Bg of B such that d(S', Bg) is large and hence the 
right hand side of Eq. fll65p small, then the reduced state g^[H]{(3) on S of the thermal 
state of H is almost independent of the terms of the Hamiltonian H that are not in the 
restricted Hamiltonian Hb- 

Theorem ini is not unexpected, but it is nevertheless remarkable that it can be 
shown in this generality for systems of both locally interacting spins and fermions. 
Even more so, because, as we have seen in the discussion of equilibration (Section [3|) 
and especially in the section on equilibration time scales, a major obstacle for improving 
the statements we were able to make is that it seems to be hard to use the structure 
of natural many-body Hamiltonians, namely that interactions are usually few body and 
often short range. Theorem [T7| is an instance of a result whose proof heavily relies on 
the locality structure of locally interacting Hamiltonians and is able to exploit their 
structure. 

It is interesting to plug in the numbers of a specihc model to see how physical 
the derived critical temperature is. As a concrete example consider the ferromagnetic 
two dimensional isotropic Ising Model without external held. The critical temperature 
of Theorem [16] and [IT] is = 2/ln((l + Ri 24.58, whereas 

the Curie temperature, i.e., the temperature at which the phase transition between 
the paramagnetic an d th e ferromagnetic phase happens is known to be l/(/9cJ) = 
2/ ln(l + \/2) ^ 2.27 jdObl . To put this into perspective however, it is worth noting that 
the above theor em s till improves upon previously known bounds like for example that 
implied by Ref. 40 1| . which yields l/{(3* J) = 124 and that it is a universal upper bound 
independent of details of the particular model. Given how difficult it is to calculate or 
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even bound critical temperatures in lattice models (both classical and quantum) and 
that good bounds are known only for very few models the existence of such a non-trivial 
and universal upper bound is quite remarkable. 

Besides being of fundamental interest, Theorem [TT] has some obvious computational 
implications: It implies that for all \f3\ < fl* reduced states of thermal states can 
be approximated with a computational cost independent of the system size and sub¬ 
exponential Jn__the reciprocal approximation error (polynomially for systems in one 
dimension) 
in Ref. 


396 


The proof of TheoremUniis based on a cluster expansion (see Lemma 6 
396 |) previously used in Ref. jdO^ to show that thermal states above a critical 
temperature can be approximated by so-called matrix product operators (MPOs). The 
subtleti es of this approximation are often misunderstood. For details see the appendix 
of Ref. 1^ . 


11. Conclusions 

In this review, we have elaborated on a question that is at the heart of the foundations of 
quantum statistical mechanics: This is the question of how pure states evolving unitarily 
according to the Schrodinger equation can give rise to a wealth of phenomena that can 
rightfully be called thermodynamic. Individual observables and entire subsystems have 
a tendency to evolve towards equilibrium values/states and then stay close to them 
for most times during the evolution or extended time intervals. It turns out that the 
equilibrium properties can be captured by suitable maximum entropy principles implied 
by quantum mechanical dynamics alone. If a part of the system can be naturally 
identified as a bath and its complement as a distinguished subsystem, a weak interaction 
naturally leads to decoherence in the energy eigenbasis, and under additional conditions 
even equilibration to a thermal state can be guaranteed. We have also discussed 
properties of thermal states in lattice systems and in particular elaborated on precise 
conditions under which correlations decay exponentially. We have also reviewed systems 
where an absence of thermalisation is anticipated and the role played by many-body 
localisation played in this context. 

Notions of information propagation as well as entanglement and correlation 
dynamics play key roles in processes of equilibration and thermalisation. Complementing 
these dynamical approaches, the immensely large dimension of the Hilbert space of 
composite quantum systems can also justify the applicability of statistical ensembles 
via typicality arguments. 

It goes without saying that we have only touched the tip of the iceberg: Many 
key questions had to be left unmentioned, despite the considerable length of the article. 
This is particularly regrettable with respect to the exciting experimental developments 
that have taken place over the recent years and now allow us to probe the questions at 
hand under remarkably precise conditions. The high degree of control offered by such 
experiments makes it possible to use them as quantum simulators assessing features 
quantitatively that are outside the scope of present analytical or numerical approaches. 
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At the same time, one reason for why many questions can not be satisfactory 
discussed here is that many key problems actually remain wide open, despite the 
enormous progress surveyed here. The question of what time scales are to be expected 
in equilibration is just as open as is a full understanding of thermalisation. And here 
the present review reveals its most important purpose: To serve as an invitation to this 
exciting held of research. 
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Appendix 

A. Remarks on the foundations of statistical mechanics 


In this appendix we briehy sketch the most inhuential canonical approaches towards 
the foundations of thermodynamics and statistical mechanics. We will roughly follow 
the historical development, but emphasise more the problems and shortcomings of the 
respective approaches rather then their undeniable success and ingenuity. 

Gontrary to the rest of this work, this appendix is rather superhcial. The main 
justihcation for the brevity is the existence of several comprehensive wor ks on the 
topic, in particular the review by Uffink Sand the book by Sklar but also 

Refs. 104 kool |410| and Ghapter 4 in Ref. (69|. Adding yet another work to this list 
simply seems superfluous and a detailed review of the history of statistical mechanics is 
beyond the scope of this work. Also, we will brush over many of the more subtle issues 
of the classical approaches, such as the interpretation of probability and the problem of 
comparing discrete and continuous measures. 

The intention of this appendix is to partially answer the legitimate question of a 
person already familiar with thermodynamics and statistical physics: “Why should I care 
about pure state quantum statistical mechanics? Weren’t all the foundational questions 
already solved in the works from the 19th and early 20th century?” As we will see, 
despite the numerous attempts and the great amount of work that has been put into 
establishing a convincing justihcation for the methods of statistical mechanics it has “not 
yet developed a set of generally accepted formal axioms” Q, or, as E. T. Jaynes j3n3| 
puts it: “There is no line of argument preceding from the laws of microscopic mechanics 
to macroscopic phenomena that is generally regarded by physicists as convincing in all 
respects.” 
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(a) Time reversal objection (Loschmidt) 



(b) Recurrence objection (Poincare, Zermelo) 
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Figure 6. (Reproducti on f rom Ref. |l2fll |i The time reversal objection, also k nown 
as Loschmidt’s paradox |4Ril |. but actually first published by William Thomson 417 |. 
states that it should not be possible to deduce time reversal asymmetric statements like 
the H-theorem, implied by the Boltzmann equation, from an underlying time reversal 
invariant theory. More explicitly, it argues that for any process that brings a system 
into an equilibrium state starting from a non-equilibrium situation, there exists an 
equally physically allowed reverse process that takes the system out of equilibrium. 
The initial state for that process is obtained from the equilibrium state by reversing 
all velocities (see Panel (a)). The recurrence objection, which is based on the Poincare 
recurrence theorem but was made explicit by Zermelo |418l |. states that Boltzmann’s 
H-theorem is in conflict with Hamiltonian dynamics, because it can be proven on very 
general grounds that all finite systems are recurrent, i.e., return arbitrarily close to 
their initial state after possibly very long times (see Panel (b)). 


A.l. Canonical approaches 


Thermodynamics was originally developed as a purely phenomenological theory. 
Prototypical for this era are the laws of Boyle-Mariotte and Gay-Lussac that state 
empirically observed relations between the volume, pressure, and temperature of gases. 

The more widespread acceptance of the atomistic hypothesis in the 18th century 
opened up t he w ay for a m icroscopic understandi ng o f such empiri^l facts. The works 
of Clausius 


4111. Maxwell 


412. 413 , Boltzmann 


4141. and Gibbs 111 in the second half 


of the 19th and the beginning of the 20t h cen t ury are often perceived as the inception 
of statistical mechanics (see also Refs, j^, 408, 415 |). In this section we review some of 
these early attempts to develop a deeper understanding of thermodynamics based on 
microscopic considerations. 


A. 1.1. Boltzmann and the H-Theorem One of Boltzmann’s arguably most important 
contributions to the development of statistical mechanics is h is de rivation of what is 
known today as the Boltzmann equation and his H-theorem 414| (se e als o the hrst 
chapter of Boltzmann’s book “Vor lesungen iiber Gastheorie. Bd. 1.” |41^ as well as 

Ref. ' ' 


69 


, Ghapter 4] and Ref. jdOSj). 


In his 1872 article |414| Boltzmann aims at showing that the Maxwell-Boltzmann 
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distribution is the equilibrium distribution of the speed of gas particles and that a gas 
with an initially different distribution must inevitably approach it. He tries to do this 
on the grounds of microscopic considerations and starts off from the prototypical model 
of the hard sphere gas. He takes for granted that in equilibrium the distribution of the 
particles should be “uniform” and that their speed distribution should be independent 
of the direction of movement. He assumes that the number of particles is large and 
introduces a continuously differentiable function he calls “distribution of state’ll, which 
is meant to approximate the (discrete) distribution of the speed of the particles. He 
then derives a differential equation for the temporal evolution of this function, known 
today as the Boltzmann equation. He also defines an entropy for the “distribution of 
state” and shows that it increases monotonically in time under the dynamics given by 
the Boltzmann equation, a statement he calls H-Theorem, after the letter H used for 
denoting the entropy. 

During the derivation he makes several approximations. Essential is his “S tofizahl 
Ansatz”, later dubbed the “hypothesis of molecular disorder” in Ref. j415l |. which 
explicitly breaks the time reversal invariance of classical mechanics. This breaking of the 
time reversal symmetry is responsible for the temporal increase of entropy. Naturally 
this assumption has been much criticised. Famous are the time reversal objection of 
William Tho mson and Loschmidt and the recurrence objection due to Poincare and 
Zermelo jdOsI ( see F igure El). The bottom line of this debate, also later acknowledged 
by Boltzmann 419| . is that any statement that implies the convergence of a finite 
system to a fixed equilibrium state/distribution in the limit of time going to infinity 
is incompatible with a time reversal invariant or recurrent microscopic theory. This is 
important for the notions of equilibration we discuss in Section [3l 


A. 1.2. Gibbs’ ensemble approach For many, Gibbs’ book “Elementary principles in 
statistical mechanics” jl| from 1902 marks the birth of modern statistical mechanics 
Central in Gibbs’ approach is the concept of an ensemble, which he describes as follows: 
“We may imagine a great number of systems of the same nature, but differing in the 
configurations and velocities which they have at a given instant (...) we may set the 
problem, not to follow a particular system through its succession of configurations, but 
to determine how the whole number of systems will be distributed among the various 
conceivable configurations and velocities at any required time [...]” 

In fact, the book then is not so much concerned with (non-equilibrium) dynamics, 
but rather with the calculation of statistical equilibrium averages. Gibbs considers 
systems whose phase space is, as in Hamiltonian mechanics, spanned by canonical 
coordinates and introduces the micro-canonical, canonical, and grand canonical 
ensemble for such systems. He assumes that the number of states is high enough 
such that a description with a, as he calls it, “structure function”, a kind of density 
of states, is possible. He shows how various thermodynamic relations for quantities such 


ji German original [414| : “Zustandsverteilung”. 
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as temperature and entropy can be reproduced from his ensembles, if these quantities 
are properly defined in terms of the structure function. 

Gibbs is mostly concerned with dehning recipes for the description of systems in 
equilibrium. He gives little insight into why the ensembles he proposes capture the 
physics of thermodynamic equilibrium or how and why systems equilibrate in the first 
place Q. Instead of addressing such foundational questions he is “contented with the 
more modest aim of deducing some of the more obvious propositions relating to the 
statistical branch of mechanics” ll|. 


A. 1.3. (Quasi-)ergodicity The ergodicity hypothesis was essentially born out of the 
incoh erent use of different interpretations of p robability by Boltzmann in his early work 


42n| and was formulated by him in Ref. 42 1| as follows: “The great irregularity of the 


thermal motion and the multitude of forces that act on a body make it probable that 
its atoms, due to the motion we call heat, traverse all positions and velocities which are 
compatible with the principle of [conservation of] enep 


ihrenfest in Ref. 



The concept of ergodicity 
who proposed the ergodic 


was made prominent by P. and T. 
foundations of statistical mechanics 

Roughly speaking, a system is called (guasi-)ergodic if it explores its phase space 
uniformly in the course of time for most initial states. Making precise what “uniformly”, 
“most”, and “in the course of time” mean in this context already constitutes a major 
challenge Q- However, if one is willing to believe that a system at hand is ergodic in 
an appropriate sense then it readily follows that (inhnite time) temporal averages of 
physical quantities in that system are (approximately and/or with “high probability”) 
equal to certain phase space averages, such as for example that given by the micro- 
canonical ensemble. 

The ergodic foundations of statistical mechanics are then roughly based on 
arguments along the following lines: Any physical measurement must be carried out 
during a finite time interval. What one actually observes is not an instantaneous value, 
but an average over this time span. The relevant time spans might seem short on a 
human time scale, but can at the same time be “close to inhnite” compared to the 
microscopic time scales. Think for example of the process of measuring the pressure in 
a gas container with a membrane. The moment of inertia of the membrane is much too 
large to observe the spikes in the force due to hits by individual particles. It is thus 
reasonable to assume that observations are well described by (inhnite time) averages of 
the corresponding quantities, which, if the system is quasi-ergodic, can be calculated by 
averaging in an appropriate way over phase space 


The arguably most striking objection against such reasoning is the following 


408 


If it were in fact true that all realistic measurements could legitimately be described as 
inhnite time averages, then the observation of any non-equilibrium dynamics, including 
the approach to equilibrium, would simply be impossible. The latter is manifestly not 

ft The English translation is taken from Ref. [^. 
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the case. 

Besides this issue of the “infinite time” averages and the other problems mentioned 
above it is extraordinarily difficult to show that a given system is (quasi-)ergodic. 
Despite the ground breaking works of Birkhoff and von Neumann on the concept of 
metric transitivity, Sinai’s work on dynamical billiards, and more recent approaches 
such as Khinchin’s ergodic theorem, the full problem still awaits solution Q. 


A. 1 . 4 . Jaynes’ maximum entropy approach Conceptually very different from the three 


previously discussed approaches is the work of Jaynes 303j. He fully embraces a 


subjective interpretation of probability and proposes to regard statistical physics as 
a “form of statistical inference rather than a physical theory”. He then introduces a 
maximum entropy principle. In short, the maximum entropy principle states that in 
situations where the existing knowledge is insufficient to make definite predictions the 
best possible predictions can be reached by finding the distribution of the state space of 
the system that maximises the (Shannon) entropy and is compati ble w ith the available 
knowledge. The principle is inspired by the work of Shannon 422| who, as Jaynes 


claims, had shown that the maximm n en tropy distribution is the one with the least 
bias towards the missing information 303|: “[The] maximum entropy distribution may 
be asserted for the positive reason that it is uniquely determined as the one which is 
maximally noncommitt al wi th respect to missing information.” 

Moreover, in Ref. (jOdl j. Jaynes shows in quite some generality that the “usual 
computational rules [as presented in Gibbs’ book [l|] are an immediate consequence of 
the maximum entropy principle”. In addition, he points out various other advantages 
of his subjective approach. For example that it makes predictions “only if the available 
information is sufficient to justify fairly strong opinions”, and that it can account for 
new information in a natural way. 

While Jaynes’ principle can be used to justify the methods of statistical mechanics 
it gives little insight into why and under which conditions these methods yield results 
that agree with experiments. In other words: The maximum entropy principle ensures 
that making predictions based on statistical mechanics is “best practice”, but does not 
explain why this “best practice” is often good enough. The question “Why does statistical 
mechanics work?” hence remains partially u nans wered. 

A last point of criticism is that Ref. jjOjI wor ks in a classical setting. While 
an extension to quantum mechanics is possible 304| the subjective interpretation of 
probability advertised by Jaynes is arguably less convincing or at le ast debat able in this 
setting, although this is of course to some extend a matter of taste 42311424| . Problems 


arise because mixed quantum states can be written as convex combinations of pure 
states in more than one way so that more complicated arguments are needed to identify 
the von Neumann entropy as the right entropy measure to be maximised. 
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A.2. Closing remarks 


Except for Jaynes subjective maximum entropy principle, all approaches we have 
discussed in this chapter differ in one important point from that advertised in the main 
part of this review: They are based on classical mechanics. The applicability of classical 
models to systems that behave thermodynamically is, however, questionable. 

Consider for example two of the most prominently used models in statistical 
mechanics: The hard sphere model for gases and the Ising model for ferromagnetism. 
The atoms and molecules of a gas, as well as the interactions between them, in principle 
require a quantum mechanical description. Yet, it is often claimed that in the so- 
called Ehrenfest limit, i.e., if the spread of the quantum mechanical wave packets of 
the individual particles is small compared to the “radius” of the particles, the classical 
hard sphere approximation is eligible. It can, however, be shown that under reasonable 
conditions systems typically leave the Ehrenfest limit on timescales much shorter than 


those of usual thermodynamic processes 69|, Chapter 4]. Moreover, whether the 


Ehrenfest limit constitutes a sufficient conditio n fo r the applicability of (semi-)classical 
approximations in the first place is debatable 425| . Similarly, the relevant elementary 
magnetic moments of a piece of iron, namely the electronic spins, are intrinsically 
quantum. In fact, it is known that classical physics alone cannot explain the phenomenon 
of ferromagnetism in a satisfactory way — a statement known as Bohr-van Leeuwen 
theorem 426| - |428] . The extremely simplified description employed in the Ising model 


can thus, despite its pedagogical value, arguably not capture all the relevant physics. 

In addition to this, there are many situations w here thermodynamic behaviour 
cannot be understood in a purely classical framework 42 9| : For example, black-body 
radiation cannot be understood without postulating a quantisation of energy to avoid 
the ultraviolet catastrophe. Further prime example for this are gases of indistinguishable 
particles. An application of classical physics leads to Cibbs’ paradox for the mixing 
entropy and the statistics of Bose and Fermi gases at low temperatures cannot be 
explained classically. Last but not least, the “freezing out” of certain internal degrees of 
freedom of molecular gases, which impacts their heat capacities, cannot be understood 
in a convincing way from classical physics alone. 
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